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Abstract 



At the present time, most physicists continue to hold a skeptical attitude to- 
ward the proposition of a 'hidden variables' interpretation of quantum theory, 
in spite of David Bohm's successful construction of such a theory and John S. 
Bell's strong arguments in favor of the idea. Many are convinced either that 
it is impossible to interpret quantum theory in this way, or that such an in- 
terpretation would actually be irrelevant. There are essentially two reasons 
behind such doubts. The first concerns certain mathematical theorems (von 
Neumann's, Gleason's, Kochen and Specker's, and Bell's) which can be applied 
to the hidden variables issue. These theorems are often credited with proving 
that hidden variables are indeed 'impossible', in the sense that they cannot 
replicate the predictions of quantum mechanics. Many who do not draw such a 
strong conclusion nevertheless accept that hidden variables have been shown to 
exhibit prohibitively complicated features. The second reason hidden variables 
are disregarded is that the most sophisticated example of a hidden variables 
theory — that of David Bohm — exhibits nonlocality, i.e., it can happen in this 
theory that the consequences of events at one place propagate to other places 
instantaneously. However, as we shall show in the present work, neither the 
mathematical theorems in question nor the attribute of nonlocality serve to 
detract from the importance of a hidden variables interpretation of quantum 
theory. The theorems imply neither that hidden variables are impossible nor 
that they must be overly complex. As regards nonlocality, this feature is present 
in quantum mechanics itself, and is a required characteristic of any theory that 
agrees with the quantum mechanical predictions. 

In the present work, the hidden variables issue is addressed in the following 
ways. We first discuss the earliest analysis of hidden variables — that of von 
Neiimann's theorem and r(^vi(nv .John S. Bell's refutation of von Neiimann's 
'impossibility proof. We recall and elaborate on Bell's arguments regarding the 
theorems of Gleason, and Kochen and Specker. According to Bell, these latter 
theorems do not imply that hidden variables interpretations are untenable, but 
instead that such theories must exhibit contextuality, i.e., they must allow for 
the dependence of measurement results on the characteristics of both measured 
system and measuring apparatus. We demonstrate a new way to understand the 
implications of both Gleason's theorem and Kochen and Specker's theorem by 
noting that they prove a result we call "spectral incompatibility" . We develop 
further insight into the concepts involved in these two theorems by investigating 
a special quantum mechanical experiment which was first described by David 
Albert. We review the Einstein-Podolsky-Rosen paradox. Bell's theorem, and 
Bell's later argument that these imply that quantum mechanics is irreducibly 
nonlocal. We present this discussion in a somewhat more gradual fashion than 
does Bell so that the logic of the argument may be more transparent. 



The paradox of Einstein, Podolsky, and Rosen was generalized by Erwin 
Schrodinger in the same paper where his famous 'cat paradox' appeared. We de- 
velop several new results regarding this generalization. We show that Schrodinger's 
conclusions can be derived using a simpler argument — one which makes clear 
the relationship between the quantum state and the 'perfect correlations' exhib- 
ited by the system. We use Schrodinger's EPR analysis to derive a wide variety 
of new quantum nonlocality proofs. These proofs share two important features 
with that of Greenberger, Horne, and Zeilinger. First, they are of a determinis- 
tic character, i.e., they are 'nonlocality without inequalities' proofs. Second, as 
we shall show, the quantum nonlocality results we develop may be experimen- 
tally verified in such a way that one need only observe the 'perfect correlations' 
between the appropriate observables; no further tests are required. This latter 
feature serves to contrast these proofs with EPR/Bell nonlocality, the laboratory 
confirmation of which demands not only the observation of perfect correlations, 
but also an additional set of observations, namely those required to test whether 
'Bell's inequality' is violated. The 'Schrodinger nonlocality' proofs we give differ 
from the GHZ proof in that they apply to two-component composite systems, 
while the latter involves a composite system of at least three-components. In 
addition, some of the Schrodinger proofs involve classes of observables larger 
than that addressed in the GHZ proof. 
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V 



"Attention has recently been called to the obvious but very dis- 
concerting fact that even though we restrict the disentangling mea- 
surements to one system, the representative obtained for the other 
system is by no means independent of the particular choice of obser- 
vations which we select for that purpose and which by the way are 
entirely arbitrary." 

Erwin Schrodinger [92] 
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Chapter 1 

Introduction 



1.1 The issue of hidden variables 

1.1.1 Contextuality, nonlocality, and hidden vsiriables 

The aim of this thesis is to contribute to the issue of hidden variables as a viable 
interpretation of quantum mechanics. Our efforts will be directed toward the 
examination of certain mathematical theorems that are relevant to this issue. 
We shall examine the relationship these theorems bear toward hidden variables 
and the lessons they provide regarding quantum mechanics itself. The theorems 
in question include those of John von Neumann [101], A. M. Gleason [57], J.S. 
Bell [7], and S. Kochen and E. P. Specker [74]. Arguments given by .John Stewart 
Bell^ demonstrate that the prevailing view that these results disprove hidden 
variables^ is actually a false one. According to Bell, what is shown^ by these 
theorems is that hidden variables must allow for two important and fundamental 

^See references by Bell: [8, 16] on the theorems of von Neumann, Gleason, and Kochen 
and Specker. Bell's works emphasize the conclusion that these theorems would place no 
serious limitations on a hidden variables theory, and he declares that [16] "What is proved by 
impossibility proofs [these theorems] is lack of imagination." A recent work by Mermin 

[80] also addresses the impact of these theorems on hidden variables. Mermin does not accept 
Bell's conclusions, however, and states "Bell ... is unreasonably dismissive of the importance 
of . . . impossibility proofs [theorems] . ... [Bell's] criticism [of these theorems] undervalues 
the importance of defining limits to what speculative theories can or cannot be expected to 
accomplish." In the present work, we argue for Bell's position on these matters. 

^The following references found in the bibliography present discussions of all four theorems 
along with their relationship to hidden variables: [6, 69, 70, 80]. Von Neumann's theorem is 
claimed to disprove hidden variables in [2, 35, 71, 101]. Tliis conclusion is reached for both 
Gleason's theorem and the theorem of Kochen and Specker in [74, 8.3] . The following references 
state that Bell's theorem may be regarded in this way: [24] ,[55, p. 172], [104]. 

^The theorem due to von Neumann is not as closely connected with these concepts as the 
others. We include it for the saJce of completeness and as introduction to the discussion of the 
other theorems. In addition, we will point out the presence in a work by Schrodinger [91] of 
an analysis leading to essentially the same conclusion as von Neumann's. 
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quantum features: contextuality and nonlocality. 

To allow for contextuality, one must consider the results of a measurement 
to depend on the attributes of both the system, and the measuring apparatus. 
The concept of contextuality is evidently in contrast to the tendency to consider 
the quantum system in isolation, without considering the configuration of the 
experimental apparatus^. Such a tendency conflicts with the lesson of Niels Bohr 
concerning [90, page 210] "the impossibility of any sharp separation between the 
behavior of atomic objects and the interaction with the measuring instruments 
which serve to define the conditions under which the phenomena appear." In 
any attempt to develop a theory of hidden variables, one must take contextuality 
into account. 

The concept of contextuality lies at the heart of both Gleason's theorem and 
the theorem of Kochen and Specker. However, typical expositions^ of these ar- 
guments give either no discussion of the concept, or else a very brief treatment 
that fails to convey its full meaning and importance. The conclusion drawn by 
authors giving no discussion of contextuality is that the mathematical impossi- 
bility of hidden variables has been proved. References which give only a brief 
mention of the concept tend to leave their readers with the impression that it 
is not of much significance and is somewhat artificial. Readers of these works 
would be led to the conclusion that Gleason's and Kochen and Specker's the- 
orems demonstrate that the prospect of hidden variables is a dubious one, at 
best. 

The concept of nonlocality is well known through Bell's famous mathemat- 
ical theorem [7] in which he addresses the problem of the Einstein-Podolsky- 
Roscn paradox^. In a system exhibiting nonlocality, the consequences of events 
at one place propagate to other places instantaneously. Although Einstein, 
Podolsky and Rosen were attempting to demonstrate a different conclusion (the 
incompleteness of the quantum theory), their analysis served to point out the 
conditions under which (as would become evident after Bell's work) nonlocality 
arises. The existence of such conditions in quantum mechanics was regarded by 
Erwin Schrodinger as being of great significance, and in a work [92] in which 
he generalized the EPR argument, he called these conditions "f/ie characteris- 
tic trait of quantum mechanics, the one that forces its entire departure from 
classical lines." Bell's work essentially completed the proof that the quantum 
phenomenon discovered by EPR and elaborated by Schrodinger truly does entail 
nonlocality. 

Although the EPR paradox and Bell's theorem are quite well known, there 

*See papers in the recent book by Bell [8, 16] for a good discussion of contextuality. 

®The exception to this is two works by Bell: bibliography references [8, 16]. In these 
works, contextuality is made quite clear, and it is concluded that neither theorem proves the 
impossibility of hidden variables. References that discuss the Kochen and Specker theorem 
without treating contextuality are [74, 78, 83]. The following references briefly discuss the 
relationship of Gleason's and Kochen and Specker's theorems to contextuality, but fail to 
convey its importance: [6, 69, 70]. 

^The EPR paper appears in [51], and it is reprinted in [103]. 



2 



exists a misperception regarding the relationship these arguments bear to non- 
locality. Some authors (mistakenly) conclude'' that the EPR paradox and Bell's 
theorem imply that a conflict exists only between local theories of hidden vari- 
ables and quantum mechanics, when a more general conclusion than this follows. 
When taken together, the EPR paradox^ and Bell's theorem imply that any lo- 
cal theoretical explanation whatsoever must disagree with quantum mechanics. 
In Bell's words: [14] "It now seems that the non-locality is deeply rooted in 
quantum mechanics itself and will persist in any completion." We can con- 
clude from the EPR paradox and Bell's theorem that the quantum theory is 
irreducibly nonlocal^. 

As we have mentioned, Erwin Schrodinger regarded the situation brought 
out in the Einstein, Podolsky, Rosen paper as a very important problem. His 
own work published just after the EPR paper [91] provided a generalization of 
this analysis. This work contains Schrodinger's presentation of the well known 
paradox of 'Schrodinger's cat'. However, a close look at the paper reveals much 
more. Schrodinger's work contains a very extensive and thought provoking anal- 
ysis of quantum theory. He begins with a statement of the nature of theoretical 
modeling and a comparison of this to the framework of quantum theory. He 
continues with the cat paradox, the measurement problem, and finally his gen- 
eralization of the EPR paradox and its implications. In addition, he derives a 
result quite similar to that of von Neumann's theorem-'^'^. 

Subsequent to our discussion of the issues of the EPR paradox. Bell's theo- 
rem, and nonlocality, we will discuss a new form of quantum nonlocality proof^^ 
based on Schrodinger's generalization of the EPR paradox. The type of argu- 
ment we shall present falls into the category of a "nonlocality without inequal- 
ities" proof^^. Such analyses — of which the first^'' to be developed was that of 
Grecnbergcr, Horne, and Zcilinger [60] — differ from the nonlocality proof de- 
rived from the EPR paradox and Bell's theorem in that they do not possess the 
statistical character of the latter. 

^Scc for example, [40, p 48], [62, 80, 83]. Clauser and Shimony [38] regard the EPR paradox 
and Bell's theorem as proof that quantum mechanics must conflict with any 'realistic' local 
theory. Their view seems to fall short of Bell's (see Bell [12, 15]) contention that it is locality 
itself which leads to this clash with quantum theory. 

^Bohm's spin singlet version [25] of EPR. 

9See the following: [1, 12, 15, 30, 64, 77, 85]. 

-'^'^ Schrodinger does not explicitly mention von Neumann's work, however. 

^^A nonlocality proof similar to that presented in this thesis was developed by Heywood and 
Redhead [66] and by Brown and Svetlichny [36]. These authors present arguments addressed 
to one particular example of the general class of maximally entangled states we consider. 
Their argumentation does not bring out all the conclusions we do in the present work, as 
they do not draw the general conclusion of nonlocality, but instead only the nonlocality of the 
particular class of hidden variables they address. 

^^See Greenberger, Horne, Shimony and Zeilinger in [61] 

^^See Mermin in [79] 



3 



1.1.2 David Bohm's theory of hidden variables 



It must be mentioned that the problem of whether hidden variables are possible 
is more than just conjectural. Louis de Broglie's 1926 'Pilot Wave' theory^^ is, 
in fact, a viable interpretation of quantum phenomena. The theory was more 
systematically presented by David Bohm in 1952^^. This encouraged de Broglie 
himself to take up the idea again. We shall refer to the theory as "Bohmian 
mechanics" , after David Bohm. 

The validity of Bohmian mechanics is a result of the first importance, since it 
restores objectivity to the description of quantum phenomena^^. The quantum 
formalism addresses only the properties of quantum systems as they appear 
when the system is measured. An objective description, by contrast, must give 
a measurement-independent description of the system's properties. The lack of 
objectivity in quantum theory is something which Albert Einstein disliked, as 
is clear from his writings: [90, page 667] 

What does not satisfy me, from the standpoint of principle, is 
(quantum theory's) attitude toward that which appears to be the 
programmatic; aim of all physics: the complete description of any 
(individual) real situation (as it supposedly exists irrespective of 
any act of observation or substantiation). 

Moreover, Einstein saw quantum theory as incomplete, in the sense that it omits 
essential elements from its description of state: [90, page 666] "I am, in fact, 
firmly convinced that the essentially statistical character of contemporary quan- 
tum theory is solely to be ascribed to the fact that this (theory) operates with an 
incomplete description of physical systems." Bohmian mechanics essentially ful- 
fills Einstein's goals for quantum physics: to complete the quantum description 
and to restore objectivity to the theoretical picture. 

Besides the feature of objectivity, another reason the success of Bohmian 
mechanics is important is that it restores determinism. Within this theory one 
may predict, from the present state of the system, its form at any subsequent 
time. In particular, the results of quantum measurements are so determined 
from the present state. Determinism is the feature with which the hidden vari- 
ables program has been traditionally associated^^. 

^*See [41, 42, 43]. A summajry of the early development of the theory is given by de BrogUe 
in [44]. 

^^Sec [28]. See also Bell in [16, 19] A recent book on Bohmian mechanics is [68]. 

^®It is for this reason that Bohm refers to his theory as an 'Ontological Interpretation' of 
quantum mechanics: [32, page 2] "Ontology is concerned primarily with that which is and 
only secondarily with how wc obtain our knowledge about this. We have chosen as the subtitle 
of our book "An Ontological Interpretation of Quantum Theory" because it gives the clearest 
and most accurate description of what the book is about.... the primary question is whether 
wc can have an adequate conception of the reality of a quantum system, be this causal or be 
it stochastic or be it of any other nature." 

^^This is the motivation mentioned by von Neumann in his no-hidden variables proof. See 
reference [101] 
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The importance of the existence of a successful theory of hidden variables 
in the form of Bohmian mechanics is perhaps expressed most succinctly by J.S. 
Bell: [16] 

Why is the pilot wave picture [Bohmian mechanics] ignored in 
text books? Should it not be taught, not as the only way, but as 
an antidote to the prevailing complacency? To show that vagueness, 
subjectivity, and indeterminism are not forced upon us by experi- 
mental facts, but by deliberate theoretical choice? 

1.2 Original results to be presented 

The analysis we present will consist of two main parts: one devoted to contex- 
tuality and the theorems of Gleason, and Kochen and Specker, and the other 
devoted to nonlocality, Bell's theorem, and Schrodinger's generalization of the 
Einstein-Podolsky-Rosen paradox. We will review and elaborate on the reasons 
why these theorems do not disprove hidden variables, but should instead be re- 
garded as illustrations of contextuality and nonlocality. Our presentation will 
link topics that have most often been addressed in isolation, as we relate the 
various mathematical theorems to one another. Our analysis of contextuality 
and the theorems of Gleason and Kochen and Specker goes beyond that of other 
authors^^ in the following ways. First, we will show that the theorems of Glea- 
son, and Kochen and Specker imply a result we call "spectral-incompatibility" . 
Regarded as proofs of this result, the implications of these theorems toward hid- 
den variables become much more transparent. Second, through discussion of an 
experimental procedure first given by David Albert [1] , we show that contextu- 
ality provides an indication that the role traditionally ascribed to the Hermitian 
operators in the description of a quantum system is not tenable^^. 

After reviewing the EPR paradox. Bell's theorem, and the argument for 
quantum nonlocality that follows, we then present what is perhaps the most 
distinctive feature of the thesis, as we focus on Erwin Schrodinger's generaliza- 
tion of the EPR paradox. First, we show that the conclusions of 'Schrodinger's 
paradox' may be reached using a more transparent method than Schrodinger's 
which allows one to relate the form of the quantum state to the perfect cor- 
relations it exhibits. Second, we use Schrodinger's paradox to derive a broad 
spectrum of new instances of quantum nonlocality. Like the proof given by 
Greenberger, Horne, and Zeilinger, these 'Schrodinger nonlocality' proofs are of 
a deterministic character, i.e., they are 'nonlocality without inequalities' proofs. 
The quantmu nonlocality result based on the EPR paradox and Bell's theo- 
rem is, on the other hand, essentially statistical, since it concerns the average 

l^Bell [8, 16] and Mermin [80]. 

^^A work by Daumer, Diirr, Goldstein, and Zanghi ([39]) argues for the same conclusion, 
but does not do so by focussing on the Albert experiment. 
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of the results from a series of quantum measurements. We demonstrate that 
just as is true for the GHZ nonlocaUty, Schrodinger nonlocahty differs from the 
EPR/BcU demonstration in that it may be experimentally confirmed simply by 
observing the 'perfect correlations' between the relevant observables. To verify 
the EPR/Bell nonlocality result, on the other hand, one must observe not only 
that the perfect correlations exist, but also that Bell's inequality is violated^'^. 
The test of Schrodinger nonlocality is simpler than that of GHZ in that the per- 
fect correlations in question are between observables of two subsystems, rather 
than between those of three or more. Moreover, some of the Schrodinger nonlo- 
cality proofs are stronger than the GHZ proof in that they involve larger classes 
of observables than that addressed by the latter. 

1.3 Review of the formalism of quantum me- 
chanics 

1.3.1 The state and its evolution 

The formalism of a physical theory typically consists of two parts. The first de- 
scribes the representation of the state of the system; the second, the time evolu- 
tion of the state. The quantum formalism contains besides these a prescription^^ 
for the results of measurement. 

Wc first discuss the quantum formalism's representation of state. The quan- 
tum formalism associates with every system a Hilbert space Ti. and represents 
the state of the system by a vector in that Hilbert space. The vector is 
referred to as the wave funcUon. This is in contrast to the classical descrip- 
tion of state which for a system of particles is represented by the coordinates 
{(/i; 92, 93, •••} and momenta {pi,p2,P3, •••}• We shall often use the Dirac no- 
tation: a Hilbert space vector may be written as , and the inner product 
of two vectors V'2) as {tpi \ tp2)- The quantum mechanical state tp in 

the case of a non-relativistic A''-particle system is given^^ by a complex-valued 
function on configuration space, ^/'(q), which is an element of L2 the Hilbert 
space of square-integrable functions. Here q represents a point {qi,q2,-..} in 
the configuration space TR^^ . The fact that ip is an element of L2 implies 



where dq = (dqidq2...). The physical state is defined only up to a multiplicative 
constant, i.e., c|V') and |^) (where c 7^ 0) represent the same state. We may 

20 See [3, 4, 5]. 

^^It is the existence of such rules in the quantum mechanical formalism that marks its 
departure from an objective theory, as we discussed above. 
^^In the absence of spin. 




(1.1) 
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therefore choose to normahze ip so that 1.1 becomes 

(V I V) = 1- (1-2) 

The time evolution of the quantum mechanical wave function is governed by 
Schrodinger's equation: 

th^=H^j^ (1.3) 

where H is an operator whose form depends on the nature of the system and 
in particular on whether or not it is relativistic. To indicate its dependence on 

time, wc write the wave function as ijji . For the case of a non-relativistic system, 
and in the absence of spin, the Hamiltonian takes the form 

3 ■' 

where V{q) is the potential energy^"^ of the system. With the time evolution so 
specified, ipt may be determined from its form f/^tu at some previous time. 



1.3.2 Rules of measurement 

As stated above, the description of the state and its evolution does not consti- 
tute the entire quantum formalism. The wave function provides only a formal 
description and does not by itself make contact with the properties of the sys- 
tem. Using onlyihe wave function and its evolution, we cannot make predictions 
about the typical systems in which we are interested, such as the electrons in an 
atom, the conduction electrons of a metal, and photons of light. The connec- 
tion of the wave function to any physical properties is made through the rules 
of measurement. Because the physical properties in quantum theory are defined 
through measurement, or observation, they are referred to as 'observables'. The 
quantum formalism represents the observables by Hermitian operators^^ on the 
system Hilbert space. 

Associated with any observable O is a set of eigenvectors and eigenvalues. 
These quantities are defined by the relationship 

0\4>)=ii\4>), (1-5) 

where is a real constant. We refer to \(j)) as the eigenvector corresponding to the 
eigenvalue /i. We label the set of eigenvalues so defined as {^Jia}■ To each member 
of this set, there corresponds a set of eigenvectors, all of which are elements of 
a subspace of H, sometimes referred to as the eigenspace belonging to fia- We 
label this subspace as Ha- Any two such distinct subspaces are orthogonal, i.e., 
every vector of Ha is orthogonal to every vector of Hb if A*a fJ-b- 
■^^We ignore here the possibihty of an external magnetic field. 

^*In the following description, we use the terms observable and operator interchangeably. 
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It is important to develop explicitly the example of a commuting set of 
observables (0^,0^,...) i.e., where each pair {0^,0^) of observables has com- 
mutator zero: 

[0\ = O'O^ - O^O' = (1.6) 
For this case, we have a series of relationships of the form 1.5 

0'\<l>) = /x» (1.7) 
i = 1,2,... 

defining the set of simultaneous eigenvalues and eigenvectors, or joint-eigenvalues 
and joint-eigenvectors. Note that 1.5 may be regarded as a vector representa- 
tion of the set of equations 1.7 where O = {O^ ,0^, . . .) and fx = (/U^, /x^, . . .) are 
seen respectively as ordered sets of operators and numbers. We use the same 
symbol, /i, to denote a joint-eigenvalue, as was used to designate an eigenvalue. 
We emphasize that for a commuting set fjL refers to an ordered set of numbers. 
The correspondence of the joint-eigenvalues to the joint-eigenvectors is similar 
to that between the eigenvalues and eigenvectors discussed above; to each /x^ 
there corresponds a set of joint-eigenvectors forming a subspace of Ti, which we 
refer to as the joint-eigenspace belonging to ju^j. 

We denote by Pa the projection operator onto the eigenspace belonging to /i„ . 
It will be useful for our later discussion to express Pa in terms of an orthonormal 
basis of Ha- If (pk is such a basis, we have 

Pa = Y.\ct>k){4>k\, (1.8) 
k 

which means that the operator which projects onto a subspace is equal to the 
sum of the projections onto the one-dimensional spaces defined by any basis of 
the subspace. If we label the eigenvalues of the observable O as Ha then O is 
represented in terms of the Pa by 

= ^Ma-Pa. (1.9) 

a 

Having developed these quantities, we now discuss the rules of measurement. 
The first rule concerns the possible outcomes of a measurement and it states 
that they are restricted to the eigenvalues {na} for the measurement of a single 
observable^^ O or jomt-eigenvalues {/.t^} for measurement a commuting set of 
observables (0^,0^,...). 

The second rule provides the probability of the measurement result equaling 
one particular eigenvalue or joint-eigenvalue: 

F(0 = Ma) = WPaW (1-10) 

P((0\0^,...) = MJ = WPaW 

'^''A measurement may be classified cither as ideal or non-ideal. Unless specifically stated, 
it will be assumed whenever we refer to a measurement process, what is said will apply just 
as well to either of these situations. 
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where the former refers to the measurement of a single observable and the latter 
to a commuting set. As a consequence of the former, the expectation value for 
the result of a measurement of O is given by 

E{o) = J2 f^aimi') = mi') (1-11) 

a 

where the last equality follows from 1.9. 

The third rule governs the effect of measurement on the system's wave func- 
tion. It is here that the measurement is governed by a different rule depending 
on whether one performs an ideal or non-ideal measurement. An ideal measure- 
ment is defined as one for which the wave function's form after measurement 
is given by the (normalized) projection of ip onto the eigenspace Ha of the 
measurement result Ha 

Pai^/WPai^W (1.12) 

The case of a non-ideal measurement arises when an ideal measurement of a com- 
muting set of obscrvablcs (O^, O^, ...) is regarded as a measurement of an indi- 
vidual member of the set. An ideal measurement of the set of observables leaves 
the wave function as the projection of tf^ onto their jomi-eigenspace. For an 
arbitrary vector ip, the projection onto the eigenspace of an individual member 
of the set does not generally equal the projection onto the set's joint-eigenspace. 
Therefore an ideal measurement of a commuting set cannot be an ideal mea- 
surement of an individual member of the set. We refer to the procedure as a 
non-ideal measurement of the observable. This completes the presentation of 
the general rules of measurement. We discuss below two special cases for which 
these rules reduce to a somewhat simpler form. 

The first special case is that of a non-degenerate observable. The eigenspaces 
of such an observable are one-dimensional and are therefore spanned by a single 
normalized vector called an eigenvector. We label the eigenvector corresponding 
to eigenvalue j^a as |a). The operator which projects onto such a one-dimensional 
space is written as: 

Pa =1 a){a I . (1.13) 

It is easy to sec that the projection given in 1.8 reduces to this form. The form 
of the observable given in 1.9 then reduces to: 

= ^Ma|a)(a|. (1.14) 

a 

The probability of a measurement result being equal to fia is then 

P{0 = ^Xa) = \{a\^P)\^ (1.15) 

in which case the wave function subsequent to the measurement is given by |a). 
These rules are perhaps the most familiar ones since the non-degenerate case is 
usually introduced first in presentations of quantum theory. 
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Finally, there is a case for which the measurement rules are essentially the 
same as these, and this is that of a set of commuting observables which form 
a complete set. The joint-cigcnspaccs of a complete set are onc-dimcnsional. 
The rules governing the probability of the measurement result fj,^ and the ef- 
fect of measurement on the wave function are perfectly analogous to those of 
the non-degenerate observable. This completes our discussion of the rules of 
measurement. 

1.4 Von Neumann's theorem and hidden vari- 
ables 

1.4.1 Introduction 

The quantum formalism contains features which may be considered objectionable' 
by some. These are its subjectivity and indeterminism. The aim of the devel- 
opment of a hidden variables theory^^ is to give a formalism which, while being 
empirically equivalent to the quantum formalism, does not possess these fea- 
tures. In the present section we shall present and discuss one of the earliest 
works to address the hidden variables question, which is the 1932 analysis of 
John von Neumann^^. We shall also review and elaborate on J.S. Bell's analysis 
[8] of this work, in which he made clear its limitations. 

Von Neumann's hidden variables analysis appeared in his now classic book 
Mathematical Foundations of Quantum Mechanics. This book is notable 
both for its exposition of the mathematical structure of quantum theory, and as 
one of the earliest works^^ to systematically address both the hidden variables 
issue and the measurement problem. The quantum formalism presents us with 
two different types of state function evolution: that given by the Schrodinger 
equation and that which occurs during a measurement. The latter evolution 
appears in the formal rule given above in equation 1.12. The measurement 
problem is the problem of the reconciliation of these two types of evolution. 

^''J.S. Bell goes further than this: he refers to quantum mechanics as unprofessional ([18] 
and [40, p. 45]) in its lack of clarity. 

^''See the discussion of Bohmian mechanics above. The great success of this theory is that 
it explains quantum phenomena without such features. The general issue of hidden variables 
is, of course, discussed in several references. Bell's work is the most definitive: see [8, 9, 16] 
in [21]. A recent review was published by N.D. Mermin, who has done much to popularize 
Bell's theorem through articles in Physics Today and through popular lectures. Discussions 
may be also be found in Bohm [28, 32], Belinfante [6], Hughes [69] and Jammer [70]. 

^^The original work is [101]. Discussions of von Neumann's hidden variables analysis may 
be found within [2, 8, 16, 71] 

^^Witliin cliaptcr 4 of tlie present work, wc sliall discuss a 1935 analysis by Erwin 
Sclirodinger [91]. Tliis is the paper in which the 'Schrodinger's cat' paradox first appeared, 
but it is not generally appreciated tliat it contains other results of equal or perhaps greater 
significance, such as Schrodinger's generalization of the Einstein-Podolsky-Rosen paradox. 
We believe that this remarkable paper could have done much to advance the study of the 
foundations of quantum mechanics, had these latter features been more widely appreciated. 
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In his analysis of the hidden variables problem, von Neumann proved a math- 
ematical result now known as von Neumann's theorem and then argued that 
this theorem implied the very strong conclusion that no hidden variables theory 
can provide empirical agreement with quantum mechanics: (preface p. ix,x) " 
. . . such an explanation (by 'hidden parameters') is incompatible with certain 
qualitative fundamental postulates of quantum mechanics." The author further 
states: [101, p. 324] "It should be noted that we need not go further into the 
mechanism of the 'hidden parameters,' since we now know that the established 
results of quantum mechanics can never be rc-dcrivcd with their help." The 
first concrete demonstration'^^ that this claim did not follow was given in 1952 
when David Bohm constructed a viable theory of hidden variables [28]. Then in 
1966, J.S. Bell [8] analyzed von Neumann's argument against hidden variables 
and showed it to be in error. In this section, we begin by discussing an essential 
concept of von Neumann's analysis: the state representation of a hidden vari- 
ables theory. We then present von Neumann's theorem and no-hidden-variables 
argument. Finally, we show where the flaw in this argument lies. 

The analysis of von Neumann is concerned with the description of the state 
of a system, and the question of the incom,pleteness of the quantum formalism's 
description. The notion of the incompleteness of the quantum mechanical de- 
scription was particularly emphasized by Einstein, as we noted in section 1.1.2. 
The famous Einstein-Podolsky-Rosen paper was designed as a proof of such in- 
completeness and the authors concluded this work with the following statement: 
[51] "While we have thus shown that the wave function does not provide a com- 
plete description of the physical reality, we left open the question of whether 
or not such a description exists. We believe, however, that such a theory is 
possible." The hidden variables program, which is an endeavor to supplement 
the state description, is apparently exactly the type of program Einstein was 
advocating. A complete state description might be constructed to remove some 
of the objectionable features of the quantum theoretical description. 

The particular issue which von Neumann's analysis addressed was the follow- 
ing: is it possible to restore determinism to the description of physical systems 
by the introduction of hidden variables into the description of the state of a 
system. The quantum formalism's state representation given by i]j does not 
generally permit deterministic predictions regarding the values of the physical 
quantities, i.e. the observables. Thus results obtained from performance of mea- 
surements on systems with identical state representations i]j may be expected 
to vary. (The statistical quantity called the dispersion is used to describe this 
variation quantitatively.). While it generally does not provide predictions for 
each individual measurement of an observable O, the quantum formalism does 

^"^See Jammer's book [70, page 272] for a discussion of an early work by Grete Hermann 
that addresses the impact of von Neumann's theorem. 
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give a prediction for its average or expectation value^""^: 



E{0) = (V I o I V)- 



(1.17) 



Von Neumann's analysis addresses the question of whether the lack of de- 
terminism in the quantum formalism may be ascribed to the fact that the state 
description as given by ip is incomplete. If this were true, then the complete 
description of state consisting of both ijj and an additional parameter we call 
A — should allow one to make predictions regarding individual measurement re- 
sults for each observable. Note that such predictability can be expressed math- 
ematically by stating that for every ijj and A, there must exist a "value map" 
function, i.e. a mathematical function assigning to each observable its value. 
We represent such a value map function by the expression V^{0). Von Neu- 
mann referred to a hypothetical state described by the parameters ip and A as 
a "dispersion-free state" , since results obtained from measurements on systems 
with identical state representations in and A are expected to be identical and 
therefore exhibit no dispersion. 

Von Neumann's theorem is concerned with the general form taken by a func- 
tion E{0), which assigns to each observable its expectation value. The function 
addressed by the theorem is considered as being of suiEcient generality that the 
expectation function of quantum theory, or of any empirically equivalent theory 
must assume the form von Neumann derived. In the case of quantum theory, 
E{0) should take the form of the quantum expectation formula 1.16. In the 
case of a dispersion-free state, the average over a series of E{0) should return 
the value of each observable. When one analyzes the form of E{0) developed in 
the theorem, it is easy to sec that it cannot be a function of the latter type. Von 
Neumann went on to conclude from this that no theory involving dispersion-free 
states can agree with quantum mechanics. However, since the theorem places an 
unreasonable restriction on the function E{0), this conclusion does not follow. 

1.4.2 Von Neumann's theorem 

The assumptions regarding the function E{0) are as follows. First, the value 
E assigns to the 'identity observable' 1 is equal to unity: 



The identity observable is the projection operator associated with the entire 
system Hilbert space. All vectors are eigenvectors of 1 of eigenvalue 1. The 

^^When generalized to the case of a mixed state, this becomes 



where [/ is a positive operator with the property Tr({7) = 1. Here U is Isnown as the "density 
matrix". See for example, [89, p. 378] 



^(1) = 1 



(1.18) 



E{0) = Tr{UO) 



(1.16) 
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second assumption is that E of any real-linear combination of observables is the 
same linear combination of the values E assigns to each individual observable: 

E{aA + bB + ...) = aE{A) + bE{B) + ... (1.19) 

where (a, 6,...) are real numbers and {A,B,...) are observables. Finally, it is 
assumed that E of any projection operator P must be non-negative: 

E{P) > 0. (1.20) 

For example, in the case of the value map function V^, P must be assigned 
either 1 or 0, since these are its possible values. According to the theorem, 
these premises imply that E{0) must be given by the form 

E{0) = Tr{UO) (1.21) 

where f is a positive operator with the property Tr([/) = 1. 

The demonstration^^ of this conclusion is straightforward. We begin by 
noting that any operator O may be written as a sum of Hermitian operators. 
Define A and B by the relationships A= ^{O + O^) and B = ^{0-0^), where 

is the Hermitian conjugate of O. Then it is easily seen that A and B are 
Hermitian and that 

= A + iB. (1.22) 

We define the function E*{0) by 

E*{0)=E{A)+iE{B), (1.23) 

where E{p) is von Neumann's E{0), and A and B are defined as above. From 
the equations 1.23 and 1.19 we have that E*{0) has the property of complex 
linearity. Note that E*{0) is a generalization of von Neumann's E{0): the 
latter is a reai-linear function on the Hermitian operators, while the former is 
a comp/ea> linear function on all operators. The general form of E* will now be 
investigated for the case of a finite-dimensional operator expressed as a matrix 
in terms of some orthonormal basis. We write the operator O in the form 

= ^|m)(m|0|n)(n|, (1.24) 

where the sums over m, n are finite sums. This form of O is a linear combination 
of the operators \m){n\, so that the complex linearity of E* implies that 

E*{0) = Y^{m\0\n)E*{\m){n\). (1.25) 

m,n 

•^^A proof of the theorem may be found in von Neumann's original work [101]. Albert- 
son presented a simplification of this proof in 1961 [2]. What we present here is a further 
simplification. 
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Now we define the operator U by the relationship Unm = E*{\m){n\) and the 
1.25 becomes 

E*{0) =Y^OmnUnm = Y.{UO)mm = Tr(C/0). (1.26) 

Since von Neumann's E{0) is a special case of E*{0), 1.26 implies that 

E{0) =T:y{UO). (1.27) 

We now show that ?7 is a positive operator. It is a premise of the theorem that 
E{P) > for any projection operator P. Thus we write E{P^) > where P^ 
is a onc-dimcnsional projection operator onto the vector Using the form of 
E found in 1.27, we have^^ 

TriUP^) = {x\U\x)>0. (1.28) 

Since X is an arbitrary vector, it follows that [/ is a positive operator. The 
relation Tr(f/) = 1 is shown as follows: from the first assumption of the theorem 
1.18 together with the form of E given by 1.27 we have Tr([/) = Tr(C/l) = 1. 
This completes the demonstration of von Neumann's theorem. 



1.4.3 Von Neumann's impossibility proof 

We now present von Neumann's argument against the possibility of hidden vari- 
ables. Consider the function E{0) evaluated on the one- dimensional projection 
operators P^. For such projections, we have the relationship 

= Pi (1-29) 

As mentioned above, in the case when E{0) is to correspond to a dispersion 

free state represented by some ip and A. it must map the observables to their 
values. We write V^{0) for the value map function corresponding to the state 

specified by ip and A. Von Neumann noted V^{0) must obey the relation: 

fiV^m = V^ifiO)), (1.30) 

where / is any mathematical function. This is easily seen by noting that the 

quantity f{0) can be measured by measuring O and evaluating / of the result. 
This means that the value of the observable f{0) will be / of the value of O. 

■^■'Thc equality Tr(?7P^) = (x I I x) ™ 1-28 is seen as follows. The expression Tr(UPx) is 
independent of the orthonormal basis c^n in terms of which the matrix representations of U and 
are expressed, so that one may choose an orthonormal basis of which |x) itself is a member. 
Since Fx =| xXxl, and | </>„>= for all |</)„> except |x>, we have TriUP^) = {x\U\x)- 
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Thus, if FJ(0) maps each observable to a value then we must have 1.30. Hence 
l/^(p2) = (y^(P0))2 which together with 1.29 implies 

V^{P^) = {V^{P^)f (1.31) 

This last relationship implies that V^{P^) must be equal to either or 1. 

Recall from the previous section the relation E(P^) = {(j)\U\(j)). If E{0) 
takes the form of a value map function such as V^{Pcf,), then it follows that 
the quantity {(p\U\<p) is equal to either or 1. Consider the way this quantity 
depends on vector \cp). If we vary \^) continuously then {<p\U\(f)) will also vary 
continuously. If the only possible values of ((/)|C/|0) are and 1, it follows that 
this quantity must be a constant, i.e. we must have either = € H, 

or {(j)\U\(p) = 1 e W. If the former holds true, then it must be that U itself 
is zero. However, with this and the form 1.27, we find that E{1) = 0; a result 
that conflicts with the theorem assumption that E{1) = 1 (1.18). Similarly, if 
{(p\U\(l)) = 1 for all ^ e W then it follows that U = 1. This result also conflicts 
with the requirement 1.18, since it leads to E{1) = Tr(l) = n where n is the 
dimensionality of Tl. 

From the result just obtained, one can conclude that any function E{0) 
which satisfies the constraints of von Neumann's theorem (see 1.18, 1.19, and 
1.20) must fail to satisfy the relationship 1.30, and so cannot be a value map 
function on the observables^^. From this result, von Neumann concluded that 
it is impossible for a deterministic hidden variables theory to provide empirical 
agreement with quantum theory: [101, p. 325] 

It is therefore not, as is often assumed, a question of a re-interpretation 
of quantum mechanics — the present system of quantum mechanics 
would have to be objectively false in order that another description 
of the elementary processes than the statistical one be possible. 

1.4.4 Refutation of von Neumann's impossibility proof 

While it is true that the mathematical theorem of von Neumann is a valid one, it 
is not the case that the impossibility of hidden variables follows. The invalidity 
of von Neumann's argument against hidden variables was shown by Bohm's 
development [28] of a successful hidden variables theory (a counter-example to 
von Neumann's proof) and by J.S. Bell's explicit analysis [8] of von Neumann's 
proof. We will now present the latter. 

The no hidden variables demonstration of von Neumann may be regarded 
as consisting of two components: a mathematical theorem and an analysis of 

**It should be noted that the same result may be proven without use of 1.30 since the fact 
that V^{P^) must be either or 1 follows simply from the observation that these are the 
eigenvalues of P^. 
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its implications toward hidden variables. As we have said, the theorem itself 
is correct when regarded as pure mathematics. The flaw lies in the analysis 
connecting this theorem to hidden variables. The conditions prescribed for the 
function E are found in equations 1.18, 1.19, and 1.20. The theorem of von 
Neumann states that from these assumptions follows the conclusion that the 
form of E{0) must be given by 1.27. When one considers an actual physical 
situation, it becomes apparent that the second of the theorem's conditions is 
not at all a reasonable one. As we shall see, the departure of this condition from 
being a reasonable constraint on E{0) is marked by the case of its application 
to non-commuting observables. 

We wish to demonstrate why the assumption 1.19 is an unjustified constraint 
on E. To do so, we first examine a particular case in which such a relationship 
is reasonable, and then contrast this with the case for which it is not. The 
assumption itself calls for the real- linearity of E{0), i.e. that E must satisfy 
E(aA + bB + . . .) = aE(A) + bE{B) + . . . for any observables {A,B, . . .} and real 
numbers {a,b, . . .}. This is in fact, a sensible requirement for the case where 
{A,B,...} are commuting observables. Suppose for example, the observables 
Oi, O27 O3 form a commuting set and that they obey the relationship Oi = 
O2 + O3. We know from the quantum formalism that one may measure these 
observables simultaneously and that the the the measurement result {01,02, 03} 
must be a member of the joint-eigenspectrum of the set. By examining the 
relation 1.7 which defines the joint-eigenspectrum, it is easily seen that any 
member of the joint eigenspectrum of Oi, O2, O3 must satisfy oi =02-1-03. This 
being the case, one might well expect that the function E(0) which in the case 
of a dispersion free state must be a value map (O) on the observables — should 
be required to satisfy E{0\) = E{02) + E{Oz). On the other hand, suppose we 
consider a set {O, P, Q} satisfying O = P + Q, where the observables P and Q 
fail to commute, i.e. [P, Q] ^ 0. It is easy to see that O commutes with neither 
P nor Q. It is therefore impossible to perform a simultaneous measurement of 
any two of these observables. Hence, measurements of these observables require 
three distinct experimental procedures. This being so, there is no justification 
for the requirement that E{0) = E{P) + E{Q) for such cases. 

As an example one may consider the case of a spin i particle. Suppose that 
the components of the spin given by CTx, (Ty and a where 

f^' = ^K+<7^), (1.32) 

are to be examined. The measurement procedure for any given component of the 
spin of a particle is performed by a suitably oriented Stern-Gerlach magnet. For 
example, to measure the .x-componcnt, the magnet must be oriented along the x- 
axis; for the y-component it must be oriented along the j/-axis. A measurement 
of (J is done using a Stern-Gerlach magnet oriented along an axis in yet another 
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direction"^^. The relationship 1.32 cannot be a reasonable demand to place on the 
expectation function E{0) of the observables (Tx,(Ty,a , since these quantities 
are measured using completely distinct procedures. 

Thus von Neumann's hidden variables argument is seen to be an unsound 
one. That it is based on an unjustified assumption is sufficient to show this. It 
should also be noted that the presence of the real-linearity postulate discussed 
above makes von Neumann's entire case against hidden variables into an argu- 
ment of a rather trivial character. Examining the above example involving the 
three spin components of a spin ^ particle, we find that the eigenvalues of these 
observables ±^ do not obey 1.32, i.e. 

4^^(4±i) (1.33) 

Since E{0) by hypothesis must satisfy 1.32, it cannot map the observables to 
their eigenvalues. Hence, with the real-linearity assumption one can almost 
immediately 'disprove' hidden variables. It is therefore apparent that Von Neu- 
mann's case against hidden variables rests essentially upon the arbitrary re- 
quirement that E{0) obey real linearity — an assumption that gives immediate 
disagreement with the simple and natural demand that E agree with quantum 
mechanics in giving the eigenvalues as the results of measurement. 



1.4.5 Summciry and further remcirks 

In our discussion of von Neumann's no hidden variables argument, we found 
that the argument may be regarded as consisting of two components: a theo- 
rem which concerns the general form for an expectation function E{0) on the 
observables, and a proof that the function E{0) so developed cannot be a value 
map fimction. Because the assumption of the rcal-lincarity of E{0) is an im- 
justified one, the work of von Neumann does not imply the general failure of 
hidden variables. Finally, we noted that assuming only the real-linearity of E, 
one may easily arrive at the conclusion that such a function cannot be a map 
to the observables' eigenvalues. Ultimately, the lesson to be learned from von 
Neumann's theorem is simply that there exists no mathematical function from 
observables to their values obeying the requirement of real-linearity. 

Abner Shimony has reported [96] that Albert Einstein was aware of both the 
von Neumann analysis itself and the reason it fails as a hidden variables impos- 
sibility proof. The source of Shimony's report was a personal communication 
with Peter G. Bergmann. Bergmann reported that during a conversation with 
Einstein regarding the von Neumann proof, Einstein opened von Neumann's 
book to the page where the proof is given, and pointed to the linearity as- 
sumption. He then said that there is no reason why this premise should hold 

*^It is not difficult to show that a defined in this way is the spin component along an axis 
which is in the x, y plane and lies at 45° from both the x and y axis. 
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in a state not acknowledged by quantum mechanics, if the observables are not 
simultaneously measurable. Here the "state not acknowledged by quantum me- 
chanics" seems to refer to von Neumann's dispersion-free state, i.e. the state 
specified by f/'jand A. It is almost certain that Erwin Schrodinger would also 
have realized the error in von Neumann's impossibility proof, since in his 1935 
paper [91] he gives a derivation which is equivalent to von Neumann's theorem 
in so far as hidden variables are concerned, yet he does not^® arrive at von 
Neumann's conclusion of the impossibility of hidden variables. We shall discuss 
Schrodinger's derivation in the next section. In view of the scarcity'^'' of early 
responses to von Neumann's proof, it is valuable to have such evidence of Ein- 
stein's and Schrodinger's awareness of the argument and its shortcomings. In 
addition, this affirms the notion that Einstein regarded the problem of finding 
a complete description of quantum phenomena to be of central importance (see 
again the Einstein quotes given in sections 1.1.2 and 1.4.1). 

In our introduction to von Neumann's theorem, we stated that the existence 
of a deterministic hidden variables theory led to the result that for each ^ and 
A, there exists a value map on the observables. We represented such value maps 
by the expression V^(0). If one considers the question of hidden variables more 
deeply, it is clear that the agreement of their predictions with those of quantum 
mechanics requires an additional criterion that beyond the existence of a value 
map for each tp and A: it requires agreement with the statistical predictions of 
the quantum formalism. To make possible the empirical agreement of quantum 
theory, in which only statistical predictions are generally possible with the de- 
terministic description of a hidden variables theory, we regard their descriptions 
of a quantum system in the following way. The quantum mechanical state given 
by V corresponds to a statistical ensemble of the states given by ip and A; the 
members of the ensemble being described by the same ip, but differing in A. 
The variation in measurement results found for a series of quantum mechanical 
systems with identical tp is to be accounted for by the variation in parameter 
A among the ensemble of tp, A states. For precise agreement in this regard, we 
require that for all ^ and O, the following relationship must hold: 



where p{X) is the probability distribution over A. 

We have seen from von Neumann's result and from our simple examination 
of the spin ^ observables <Tx,(Ty, -^{<^x + ""y)) that it is impossible to develop 

*®In fact, if Schrodinger had interpreted his result this way, this — in Ught of his own gener- 
alization of the EPR paradox presented in the same paper— would have allowed him to reach 
a further and very striking conclusion. Wc shall discuss this in chapter 4. 

''^See Max Jammer's book [70, p. 265]. Jammer mentions that not only was there very 
little response to von Neumann's impossibility proof, but the book itself was never given a 
review before 1957, with the exception of two brief works by Bloch and Margenau (see the 
Jammer book for these references). 




(1.34) 
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a linear function mapping the observables to their eigenvalues. We shall find 
also that an impossibility proof may be developed showing that the criterion of 
agreement with the quantum statistics, i.e., the agreement with 1.34, cannot be 
met by functions of the form V^{0). Bell's theorem is, in fact, such a proof. 
We will present the proof of Bell's result along with some further discussion in 
chapter 3. 

1.4.6 Schrodinger's derivation of von Neumann's 'impos- 
sibility proof 

As mentioned above, in his famous "cat paradox" paper [91], Schrodinger pre- 
sented an analysis which, as far as hidden variables arc concerned, was essentially 
equivalent to the von Neumann proof. Schrodinger's study of the problem was 
motivated by the results of his generalization of the Einstein-Podolsky-Rosen 
paradox. While EPR had concluded definite values on the position and mo- 
mentum observables only, Schrodinger was able to show that such values must 
exist for all observables of the state considered by EPR. We will discuss both 
the original Einstein Podolsky Rosen analysis, and Schrodinger's generalization 
thereof in much more detail in chapter 4. To probe the possible relationships 
which might govern the values assigned to the various observables, Schrodinger 
then gave a brief analysis of a system whose Hamiltonian takes the form 

H=p^+a^q^. (1.35) 

We are aware from the well-known solution of the harmonic oscillator problem, 
that this Hamiltonian's eigenvalues are given by the set {a?i, 3a?i, 5a?i, 7a7i, . . .}. 
Consider a mapping V(0) from observables to values. If we require that the 
assignments V makes to the observables H,p,q satisfy 1.35 then we must have 

V{H) = {V{p)f+a\V{q)f, (1.36) 

which implies 

{{V{p)f + a^{V{q)f)/ah = an odd integer. (1.37) 

This latter relationship cannot generally be satisfied by the eigenvalues of q, and 
p — each of which may be any real number — and an arbitrary positive number 
a. 

The connection of this result to the von Neumann argument is immediate. 
In the discussion of section 1.4.3, we noted that the value of the observable 
/(O) will be / of the value of O, so that any value map must satisfy f{V{0)) = 
y(/(0)), as given in equation 1.30. Here / may be any mathematical function. 
It follows from this that 1.36 is equivalent to a relation between the observables 
H, p^, and q^ given by: 

V{H) = V{p'')+a''V{q^). (1.38) 
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With the known eigenvalues of iJ, this leads to 

+ a'^V{q^))/an = an odd integer, (1.39) 

which cannot generally be satisfied by the eigenvalues of g^, and each of 
which may be any positive real number — and an arbitrary positive number a. 
We have here another example leading to a demonstration of von Neumann's 
result that there is no linear value map on the observables (Recall the example 
of the spin component observables — ax,cry and a — -^{<^x + <^y) given above.). 
If we consider the von Neumann function E{0), the real-linearity assumption 
requires it to satisfy 1.38. Therefore, E{0) cannot map the observables to their 
eigenvalues. Schrodinger did not regard this as proof of the impossibility of 
hidden variables, as von Neumann did, but concluded only that the relationships 
such as 1.35 will not necessarily be satisfied by the value assignments made to 
the observables constrained by such a relation. Indeed, if Schrodinger had made 
von Neumann's error of interpretation, this would contradict results he had 
developed previously according to which such hidden variables must exist. Such 
a contradiction would have allowed Schrodinger to reach a further conclusion 
which is quite striking, and which we shall discuss in chapter 4. 
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Chapter 2 

Contextuality 



We have seen in the previous section that the analysis of von Neumann has 
little impact on the question of wlictlu^r a viable hidden variables theory may be 
constructed. However, further mathematical results were developed by Andrew 
M. Gleason [57] in 1957 and by Simon Kochen and E.P. Specker in 1967 [74], 
which were claimed by some^ to imply the impossibility of hidden variables. 
In the words of Kochen and Specker [74, p. 73]: "If a physicist X believes in 

hidden variables the prediction of X contradicts the prediction of quantum 

mechanics" . The Gleason, and Kochen and Specker arguments are in fact, 
stronger than von Neumann's in that they assume linearity only for commuting 
observables. Despite this, a close analysis reveals that the impossibility proofs of 
Gleason and of Kochen and Specker share with von Neumann's proof the neglect 
of the possibility of a hidden variables feature called contextuality [8, 16]. We 
will find that this shortcoming makes these theorems inadequate as proofs of 
the impossibility of hidden variables. 

We begin this chapter with the presentation of Gleason's theorem and the 
theorem of Kochen and Specker. We then discuss a more recent theorem dis- 
covered by Mcrmin^, which is similar to these, and yet admits a much simpler 
proof. This will be followed by a discussion of contextuality and its relevance 
to these analyses. We will make clear in this discussion why the theorems in 
question fail as impossibility proofs. As far as the question of what conclu- 
sions do follow, we show that these theorems' implications can be expressed in 
a simple and concise fashion, which we refer to as "spectral-incompatibility". 
We conclude the chapter with the discussion of an experimental procedure first 
discussed by Albert'^ which provides further insight into contextuality. 

^Scc [74, 78, 83]. 

2 Sec Mermin in [78, 80]. 

3See D. Albert [1]. 
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2.1 Gleason's theorem 



Von Neumann's theorem addressed the question of the form taken by a func- 
tion E{0) of the observables. Gleason's theorem essentially addresses the same 
question'*, the most significant difference being that the linearity assumption is 
relaxed to the extent that it is demanded that E be linear on only commut- 
ing sets of observables. Besides this, Gleason's theorem involves a function E 
on only the projection operators of the system, rather than on all observables. 
Finally, Gleason's theorem contains the assumption that the system's Hilbert 
space is at least three dimensional. As for the conclusion of the theorem, this 
is identical to von Neumann's: E{P) takes the form E{P) = Ti{UP) where U 
is a positive operator and Tr(J7) = 1. 

Let us make the requirement of linearity on the commuting observables some- 
what more explicit. First we note that any set of projection {Pi,P2) • • •} onto 
mutually orthogonal subspaces {Hi, • • •} will form a commuting set. Fur- 
thermore, if P projects onto the direct sum Hi ® 7^2 ® ■ • • of these subspaces, 
then {P, Pi,P2. • • •} will also form a commuting set. It is in the case of this 
latter type of set that the linearity requirement comes into play, since these 
observables obey the relationship 

P = Pi + P2 + . . . . (2.1) 

The condition on the function E is then 

E{P) = E{P,) + E{P2) + . . . . (2.2) 

The formal statement of Gleason's theorem is expressed as follows. For any 

quantum system whose Hilbert space is at least three dimensional, any expec- 
tation function E{P) obeying the conditions 2.2, < E{P) < 1, and E{1) = 1 
must take the form 

E{P) = Tr(f/P), (2.3) 

where Tr(l7) = 1 and 17 is a positive operator. We do not present the proof of 
this result^ here. In the next section, we give the outline the proof of Kochen 
and Specker's theorem. The same impossibility result derived from Gleason's 
theorem follows from this theorem. . 

It is straightforward to demonstrate that the function E{P) considered 
within Gleason's theorem cannot be a value map function on these observables. 

^Thc original form presented by A.M. Gleason referred to a probability measure on the 
subspaces of a Hilbert space, but the equivalence of such a construction with a value map on 
the projection operators is simple and immediate. This may be seen by considering that there 
is a one-to-one correspondence of the subspaces and projections of a Hilbert space and that 
the values taken by the projections are 1 and 0, so that a function mapping projections to 
their eigenvalues is a special case of a probability measure on these operators. 

®See Bell [8]. Bell proves that any function E{P) satisfying the conditions of Gleason's 
theorem cannot map the projection operators to their eigenvalues. 
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To demonstrate this, one may argue in the same fashion as was done by von 
Neumann, since the form developed here for E{P) is the same as that concluded 
by the latter. (See section 1.4.3). We recall that if E{0) is to represent a dis- 
persion free state specified by ijj and A, it must take the form of such a value 
map, and E{0) evidently cannot be the expectation function for such a state. 
It is on this basis that the impossibility of hidden variables has been claimed to 
follow from Gleason's theorem. 

2.2 The theorem of Kochen and Specker 

As with Gleason's theorem, the essential assumption of Kochen and Specker's 
theorem is that the expectation function E{0) must be linear on commuting 
sets of obscrvablcs. It differs from the former only in the set of obscrvablcs 
considered. Gleason's theorem was addressed to the projection operators on 
a Hilbert space of arbitrary dimension N. Kochen and Specker consider the 
squares {sg ^} of the spin components of a spin 1 particle. One may note 
that these observables are formally identical to projection operators on a three- 
dimensional Hilbert space. Thus, the Kochen and Specker observables are a 
subset^ of the "N = 3" case of the Gleason observables. Among the obscrvablcs 
{sg ^} any subset {s^, Sy,s^} corresponding to mutually orthogonal component 
directions x, y, z will be a commuting set. Each such set obeys the relationship 

sl + sl + sl = 2. (2.4) 

For every such subset, Kochen and Specker require that E[sg ^) must obey 

E{sl) + Eisl) + E{sl) = 2. (2.5) 

Kochen and Specker theorem states that there exists no function E{sg ^) on the 
squares of the spin components of a spin 1 particle which maps each observable 
to either or 1 and which satisfies 2.5. 

We now make some comments regarding the nature of this theorem's proof. 
The problem becomes somewhat simpler to discuss when formulated in terms 
of a geometric model. Imagine a sphere of unit radius surrounding the origin 
in IR"*. It is easy to see that each point on this sphere's surface corresponds 
to a direction in space, which implies that each point is associated with one 
observable of the set {s^ ^}. With this, E may be regarded as a function on the 
surface of the unit sphere. Since the eigenvalues of each of these spin observables 
are and 1, it must be that E{0) must take on these values, if it is to assume 
the form of a value map function. Satisfaction of 2.5 requires that for each set 
of mutually orthogonal directions, E must assign to one of them and to the 
other directions 1. To gain some understanding'' of why such an assignment 

^The set of projections on a three-dimensional space is actually a larger class of observables. 
''^We follow here the argument given in Belinfante [6, p. 38] 
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of values must fail, we proceed as follows. To label the points on the sphere, 
we imagine that each point on the sphere to which the number is assigned is 
painted red, and each point assigned 1 is painted blue. We label each direction 
as by the unit vector n. Since one direction of every mutually perpendicular 
set is assigned red, then in total we require that one-third of the sphere is 
painted red. If we consider the components of the spin in opposite directions 
9,(j), and 180° — 6*, 180 + 0, these values are always opposite, i.e., if takes 
the value +1, then S-^ takes the value —1. This implies that the values of 
and .s^gQo_Q 180+0 ^^'^^^ equal. There, we must have that points on the sphere 
lying directly opposite each other, i.e. the "antipodes" , must receive the same 
assignment from E. Suppose that one direction and its antipode are painted 
red. These points form the two poles of a great circle, and all points along 
this circle must then be painted blue, since all such points represent directions 
orthogonal to the directions of our two 'red' points. For every such pair of red 
points on the sphere, there must be many more blue points introduced, and we 
will find this makes it impossible to make one-third of the sphere red, as would 
be necessary to satisfy 2.5. 

Suppose we paint the entire first octant of the sphere red. In terms of the 
coordinates used by geographers, this is similar to the region in the Northern 
hemisphere between degrees and 90 degrees longitude. If the point at the 
north pole is painted red then the great circle at the equator must be blue. 
Suppose that the degree and 90 degree meridians are also painted red. Then 
the octant which is the antipode of the first octant must also be painted red. 
This octant would be within the 'southern hemisphere' between 180 degrees 
and 270 degrees longitude. If we now apply the condition that for every point 
painted red, all points lying on the great circle defined with the point at its 
pole, we find that all remaining points of the sphere must then be colored 
blue, thereby preventing the addition of more red points. This assignment 
fails to meet the criterion that less than one-third of the sphere is blue, so 
there are some sets of mutually perpendicular directions which are all colored 
blue. An example of such a set of directions is provided by the points on 
the sphere's surface lying in the mutually orthogonal directions represented by 
(.5, .5, -.7071), (-.1464, .8535, .5), (.8535, -.1464, .5). Each of these points lies 
in a quadrant to which we have assigned the color blue by the above scheme. 
This assignment of values to the spatial directions must therefore fail to meet 
the criteria demanded of the function E{s'^ ^) in Kochen and Specker's premises: 
that E{sg^^) satisfies 2.5 and maps the observables to their values. 

In their proof, Kochen and Specker show that for a discrete set of 117 differ- 
ent directions in space, it is impossible to give appropriate value assignments to 
the corresponding spin observables*. Kochen and Specker then assert that hid- 

*Sincc their presentation, the proof has been simpUfied by Peres in 1991 [86] whose proof 
is based on examination of 33 such h vectors. We also note that a proof presented by Bell [8] 
may be shown to lead easily to a proof of Kochen and Specker. See Mermin in this connection 
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den variables cannot agree with the predictions of quantum mechanics. Their 
conclusion is that if some physicist 'X', mistakenly decides to accept the va- 
lidity of hidden variables then "the prediction of X (for some measurements) 
contradicts the prediction of quantum mechanics." [74, p 73] The authors cite 
a particular system on which one can perform an experiment they claim reveals 
the failure of the hidden variables prediction. We will demonstrate in the next 
section of this work that what follows from Kochen and Specker's theorem is 
only that a non-contextual hidden variables theory will conflict with quantum 
mechanics, so that the general possibility of hidden variables has not been dis- 
proved. Furthermore, we show that if the type of experiment envisioned by these 
authors is considered in more detail, it does not indicate where hidden variables 
must fail, but instead serves as an illustration indicating that the requirement 
of contextuality is a quite natural one. 

2.2.1 Mermin's theorem 

We would like to discuss here a much more recent theorem discovered by N. 
David Mermin^ in 1990, which is of the same character as those considered 
above. Mermin makes essentially the same assumption regarding the function 
E{0) as did Gleason, and Kochen and Specker: that E{0) must obey all re- 
lationships among the commuting sets of observables. This theorem is more 
straightforward in its proof and simpler in form than those of Gleason and of 
Kochen and Specker. 

The system addressed by Mermin's theorem is that of a pair of spin i par- 
ticles. The observables involved are the x and y components of these spins, and 
six other observables which are defined in terms of these four. We begin with 
the derivation of the expression 

= -1, (2.6) 

since this is actually quite crucial to the theorem. For simplicity, we normalize 
the spin eigenvalues from ±i to ±1. To demonstrate 2.6, we make use of the 
commutation rules for the x and y components of two spin ^ particles. Any 
pair of such observables associated with different particles will commute, so that 
we have [(Ti^\ cri^''] = and [(7^\a'j^^] = 0, for example. Any pair which in- 
volves the same component will also commute, thus we have [ai^^ , tri^^] = and 

(2) (2) 

[cri , (Ji ] = 0. Note that commutation of two observables Oi,02 implies that 
O1O2 = O2O1. Those pairs associated with the same particle but different com- 
ponents do not commute, but anticommute. For two anticommuting observables 
Oi, O2, it follows that their anticommutator [Oi, O2]''' = O1O2 + O2O1 is equal 
to 0. This implies that O1O2 = —O2O1. Using these rules, we may manipulate 
the expression on the left hand side of 2.6 by sequentially interchanging the first 

[80, p. 806]. 

9 See [78] and [80] 
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ai^' with the operator appearing to its right. If we exchange a^P with ay^^ 
and the expression becomes —al^^Oy^^al^^a^^a^^a^^ay^^a'j^K The overall 
minus sign results from its interchange with ay^K At this point, it is straight- 
forward to simplify the expression using that the square of any component of 
the spin has the value 1, i.e. (cr]*^)^ = 1. If we apply this to the expression 
in question, we can easily see that the entire expression reduces to —1, thereby 
verifying 2.6. 

Motivated by the expression 2.6, wc introduce six additional observables 

{A,B,C,X,Y,Z}. (2.7) 

If wc group the observables in the left hand side of 2.6 by pairs, we can rewrite 
this relationship as ABXY = —1 where A, B, X, Y are defined as 



A = 




(2.8) 


B = 






X = 






Y = 







Defining observables C and Z as 

C = AB (2.9) 
Z = XY 

then allows one to write 2.6 as 

CZ = -1. (2.10) 

It is important to note that the equation 2.10 is equivalent to 2.6, given that 
A, B, C, X, Y, Z are defined as given above in 2.9 and 2.8. 

If we examine the first equation in 2.8, we see that the three observables 
involved are a commuting set: [alc'\o-^^] = 0, [ai"^\^] = [ai^\ai^^ al^^] = 

(2) (2^ (1^ (2) 

and [ay ,A] = [ay ,ax ay ] = 0. Examination of the other equations in 2.8 
reveals that the same holds true for these, i.e. the observables in each form a 
commuting set. Repeated application of the commutation rules reveals that the 
sets {C, A, B}, {Z, X, Y}, and {C, Z} are also commuting sets. As was done in 
Gleason's theorem and Kochen and Specker's theorem, we consider the ques- 
tion of a function E{0) on the observables ai"^' , ay^^ , a^^ , a'^^ , A, B, C, X, Y, Z 
which returns for each observable its value. We require that E{0) satisfy all 
constraining relationships on each commuting set of observables. The relation- 
ships in question difi^er from those of Gleason (2.2), and of Kochen and Specker 
(2.5) only in that they involve products of observables, rather than just linear 
combinations. However, this distinction is not a significant one — the essential 
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feature of all of these theorems is simply that they require E{0) to satisfy the 
relationships constraining each commuting set; no relations on non-commuting 
obscrvablcs arc regarded as necessary constraints. The relationships in question 
in the present analysis are the defining equations in 2.8, 2.9 and 2.10. Given 
that E{0) satisfies these, the Mermin theorem implies that this function cannot 
map the observables to their eigenvalues. 

We will now present the proof of the theorem. Since ct^*'' = ±1 Vi,j, the 
eigenvalues of each of the ten observables will be ±1. This is readily seen for 
the obscrvablcs A,B,X,Y defined by 2.8: each of these is the product of two 
commuting observables whose eigenvalues are 1 and —1. With this, it follows 
similarly that C and Z each have eigenvalues of ±1. We therefore require 
that E{0) must assign cither —1 or 1 to each observable. Now let us recall an 
important result which we saw above: the relationship C Z = —1 is equivalent to 
the relationship 2.6, provided that the observables A, B, C, X, Y, Z are defined 
by 2.9, and 2.8. 

Consider the function E{0). Since the assignments made by this function 
have been required to satisfy all the commuting relationships 2.10, 2.9, and 
2.8, it follows that they must also satisfy 2.6. However, if we examine 2.6, 
it follows that no assignment of the values —1 and +1 can possibly be made 
which will satisfy this equation. This follows since each of the spin observables 
appears twice in the left hand side of 2.6, so that any such assignment must 
give a value of 1 to the entire expression, but the right hand side of 2.6 is — 1. 
Therefore, there is no function E{0) which maps each observable of the set 
'Jx'^ , <^\)'\ cri^"* , cTj/^'' ,A,B,C,X,Y,Z to an eigenvalue, if we insist that E{0) on 
every commuting set must obey all the constraint equations. This completes 
the proof of Mermin's theorem. 

2.3 Contextuality and the refutation of the im- 
possibility proofs of Gleason, Kochen and 
Specker, and Mermin 

We have seen that the theorems of Gleason, Kochen and Specker, and Mermin 
all demonstrate the impossibility of value maps on some sets of observables 
such that the constraining relationships on each commuting set of observables 
are obeyed. One might be at first inclined to conclude with Kochen and Specker 
that such results imply the impossibility of a hidden variables theory. However, 
if we consider that there exists a successful theory of hidden variables, namely 
Bohmian mechanics [28] (see section 1.1.2), we see that such a conclusion is in 
error. Moreover, an explicit analysis of Gleason's theorem has been carried out 
by Bell [8, 16] and its inadequacy as an impossibility proof was shown. Bell's 
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argument may easily be adapted^" to provide a similar demonstration regarding 
Kochen and Specker's theorem. The key concept underlying Bell's argument is 
that of contextuality, and we now present a discussion of this notion. 

Essentially, contextuality refers to the dependence of measurement results 
on the detailed experimental arrangement being employed. In discussing this 
notion, we will find that an inspection of the quantum formalism suggests that 
contextuality is a natural feature to expect in a theory explaining the quantum 
phenomena. Furthermore we shall find that the concept is in accord with Niels 
Bohr's remarks regarding the fundamental principles of quantum mechanics. 
According to Bohr, [33] "a closer examination reveals that the procedure of 
measurement has an essential influence on the conditions on which the very 
definition of the physical quantities in question rests." In addition, he stresses 
[90, p. 210] "the impossibility of any sharp distinction between the behavior of 
atomic objects and the interaction with the measuring instruments which serve 
to define the conditions under which the phenomena appear." The concept of 
contextuality represents a concrete manifestation of the quantum theoretical 
aspect to which Bohr refers. We will first explain the concept itself in detail, 
and then focus on its relevance to the theorems of Gleason, Kochen and Specker, 
and Mermin. 

We begin by recalling a particular feature of the quantum formalism. In the 

presentation of this formalism given in chapter one, we discussed the representa- 
tion of the system's state, the rules for the state's time evolution, and the rules 
governing the measurement of an observable. The measurement rules are quite 
crucial, since it is only through measurement that the physical significance of 
the abstract quantum state (given by ip) is made manifest. Among these rules, 
one finds that any commuting set of obscrvables may be measured simultane- 
ously. With a little consideration, one is led to observe that the possibility exists 
for a variety of different experimental proeedures to measure a single observable. 
Consider for example, an observable O which is a member of the commuting 
set {O, Ai, A2, ■ ■ .}. We label this set as C. A simultaneous measurement of the 
set C certainly gives among its results a value for O and thus may be regarded 
as providing a measurement of O. It is possible that O may be a member of 
another commuting set of obscrvables C = {O, £>i, B2, ■ ■ .}, so that a simulta- 
neous measurement of C also provides a measurement of O. Let us suppose 
further that the members of set {Ai} fail to commute with those of {Bi}. It is 
then clear that experiments measuring C and C are quite different, and hence 
must be distinct. A concrete difference appears for example, in the effects of 
such experiments on the system wave function. The measurement rules tell us 
that the wave function subsequent to an ideal measurement of a commuting set 
is prescribed by the equation 1.12, according to which the post- measurement 

^"As we have mentioned, since the Kochen and Specker obscrvables are formally equivalent 
to projections on a three-dimensional Hilbert spa<;e, this theorem is actually a special case of 
Gleason's. Therefore, Bell's argument essentially addresses the Kochen and Specker theorem 
as well as Gleason's. 
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wave function is calculated from the pre-measurement wave function by tak- 
ing the projection of the latter into the joint-eigenspace of that set. Since the 
members of C and C fail to commute, the joint-cigenspaccs of the two arc nec- 
essarily different, and the system wave function will not generally be affected 
in the same way by the two experimental procedures. Apparently the concept 
of 'the mcasiircment of an observable' is ambiguous, since there can be distinct 
experimental procedures for the measurement of a single observable. 

There are, in fact, more subtle distinctions between different procedures for 
measuring the same observable, and these also may be important. Wc will 
see such an example in studying Albert's experiment at the conclusion of this 
chapter. To introduce the experimental procedure of measurement into our 
formal notation, we shall write £{0), £ (O), etc., to represent experimental 
procedures used to measure the observable O. From what we have seen here, 
it is quite natural to expect that a hidden variables theory should allow for 
the possibility that different experimental procedures, e.g., £{0) and £ (O), for 
the measurement of an observable might yield different results on an individual 
system. This is contextuality. 

Examples of observables for which there exist incompatible measurement 
procedures are found among the observables addressed in each of the theo- 
rems of Gleason, Kochen and Specker, and Mermin. Among observables ad- 
dressed by Gleason are the one-dimensional projection operators {P^} on an 
A'^-dimensional Hilbert space Tijv- Consider a one-dimensional projection 
where <j) belongs to two sets of orthonormal vectors given by {<j), V'l, '^i'2i • • •} and 
{(j), xi, X2, ■ ■ •}• Note that the sets {ipi, ip2j ■ ■ ■} and {xi, X2, ■ ■ •} are constrained 
only in that they span Ti^ (the orthogonal complement of the one-dimensional 
space spanned by V')- Given this, there exist examples of such sets for which 
some members of {ipi, ip2, ■ ■ •} are distinct from and not orthogonal to the vec- 
tors in {xi,X2, . . .}■ Since any distinct vectors that are not orthogonal corre- 
spond to projections which fail to commute, the experimental procedures £{P4,) 
measuring {P^, P,p-^ , P^p.^ , . • •} and £ (P,!,) measuring {P,p, P^i . ; • • •} arc in- 
compatible. The argument just given applies also to the Kochen and Specker 
observables (the squares of the spin components of a spin 1 particle), since these 
are formally identical to projections on a three-dimensional Hilbert space. To 
be explicit, the observable is a member of the commuting sets {s^, s^j Sz} and 
{s^, s^, , s^, } where the y and z are oblique relative to the y and z axes. In this 
case, s^i , s^, do not commute with s^, s^. Thus, the experimental procedures to 
measiirc these sets arc incompatible. If wc examine the Mermin observables, we 
find that here also, each is a member of two incompatible commuting sets. For 
example cr^^ belongs to {cr'i^ , cr'^\ A} , and to {a'^\ a'i'\ X} . Here, the observ- 
ables a'^\A do not commute with a^\x, so that the experimental procedures 
£{a^x^) and £ {(t^'^) that entail respectively the measurement of {a^\a^\A\ 
and {a'^\a'i^\x} are incompatible. 
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While it is true that the arguments against hidden variables derived from 
these theorems are superior to von Neumann's, since they require agreement 
only with operator relationships among commuting sets, these arguments nev- 
ertheless possess the following shortcoming. Clearly, the mathematical functions 
considered in each case, E(P),E{sg ^) &nd.E{a-''x\a^\a'i'\a-'^\A,B,C^X^Y^Z) 
do not allow for the possibility that the results of measuring each observable us- 
ing different and possibly incompatible procedures may lead to different results. 
What the theorems demonstrate is that no hidden variables formulation based 
on assignment of a unique value to each observable can possibly agree with quan- 
tum mechanics. But this is a result we might well have expected from the fact 
that the quantum formalism allows the possibility of incompatible experimental 
procedures for the measurement of an observable. For this reason, none of the 
theorems here considered — Gleason's theorem, Kochen and Specker's theorem 
and Mermin's theorem — imply the impossibility of hidden variables, since they 
fail to account for such a fundamental feature of the quantum formalism's rules 
of measurement. 

2.3.0.1 Discussion of a procedure to measure the Kochen and Specker 
observables 

In a discussion of the implications of their theorem, Kochen and Specker men- 
tion a system for which well-known techniques of atomic spectroscopy may be 
used to measure the relevant spin observables. Although these authors mention 
this experiment to support their case against hidden variables, the examination 
of such an experiment actually reinforces the assertion that one should allow 
for contextuality — the very concept that refutes their argument against hidden 
variables. 

Kochen and Specker note^^ that for an atom of orthohelium^^ which is sub- 
jected to an electric field of a certain configuration, the first-order effects of 
this field on the electrons may be accounted for by adding a term of the form 
aS^ + bSy + cSl to the electronic Hamiltonian. Here a, 6, c are distinct con- 
stants, and S^, Sy, Si arc the squares of the components of the total spin of the 
two electrons with respect to the Cartesian axes x, y, z. The Cartesian axes are 
defined by the orientation of the applied external field. For such a system, an 
experiment measuring the energy of the electrons also measures the squares of 
the three spin components. To see this, note that the value of the perturbation 

^-'^Onc can derive the analogous first-order perturbation term arising for a cliargcd particle 
of orbital angular momentum L = 1 in such an electric field using the fact that the joint- 
eigenstates of L-^, Ly, Ly are the eigenstates of the potential energy due to the field. This 
latter result is shown in Kittel [73, p. 427]. 

^^Orthohelium and parahelium are two species of helium which are distinguished by the 
total spin S of the two electrons: for the former we have 5=1, and for the latter S = 0. 
There is a rule of atomic spectroscopy which prohibits atomic transitions for which AS = 1, 
so that no transitions from one form to the other can occur spontaneously. 
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energy will be (a + b), (a + c), or (6 + c) if the joint values of the set Sl,Sy, 
equal respectively {1, 1,0}, {1, 0, 1}, or {0, 1, 1}. 

To understand why the external electric field affects the orthohelium elec- 
trons in this way, consider the ground state of orthohelium^'^. The wave func- 
tion of this state is given by a spatial part (j){r\,r2), depending only on r\,r2 
(the radial coordinates of the electrons), multiplied by the spin part, which 
is a linear combination of the eigenvectors -0-1-1, ■00, "0-1 of , corresponding 
to Sz = +1,0,-1, respectively. Thus, the ground state may be represented 
by any vector in the three-dimensional Hilbert space spanned by the vectors 
0(''i, ^2)V'-i-i) </'(''i) ^2)'i/'o, ^(''1, ''2)V'-i- The external electric field will have the 
effect of "lifting the degeneracy" of the state, i.e. the new Hamiltonian will not 
be degenerate in this space, but its eigenvalues will correspond to three unique 
orthogonal vectors. Suppose that we consider a particular set of Cartesian axes 
X, y, z. We apply an electric field which is of orthorhombic symmetry^^ with re- 
spect to these axes. It can be shown'^^ that the eigenvectors of the Hamiltonian 
due to this field are vi = 1/V^{{ipi — f/'-i), V2 ~ 1/V^{ipi + ^-i) and 113 = ^po- 
We drop the factor 0(ri,r2) for convenience of expression. These vectors are 
also the joint-eigenvectors of the observables S^, Sy, S^, as we can easily show. 
When expressed as a matrix in the {tp+i,ijjo,ip-i} basis, the vectors vi,V2,V3 
take the form 



= (2.11) 




V2 = { \ (2.12) 
V3 = 1 . (2.13) 



^^Using spectroscopic notation, this state would be written as the '2^S' state of orthohelium. 
The '2' refers to the fact that the principle quantum number n of the state equals 2, 'S' denotes 
that the total orbital angular momentum is zero, and the '3' superscript means that it is a 
spin triplet state. Orthohelium has no state of principle quantum number n = 1, since the 
Pauli exclusion principle forbids the 'l^S' state. 

^^Orthorhombic symmetry is defined by the criterion that rotation about either the x ot y 
axis by 180° would bring such a field back to itself. 

'-■■'A straightforward way to sec this is by analogy with a charged particle of orbital angular 
moinciituiri L = I. The effects of an electric or magnetic field on a charged particle of spin 1 are 
analogous to the effects of the same field on a charged particle of orbital angular momentum 
1. To calculate the first-order effects of an electric field of orthorhombic symmetry for such 
a particle, one can examine the spatial dependence of the Lz = 1,0,-1 states •tp-i,4'o,4'+ly 
together with the spatial dependence of the perturbation potential V{r), to show that the 
states l/y^Vi ~ V"-!), l/v^CV"! + V"-!), ^■iid V'o are the eigenstates of such a perturbation. 
A convenient choice of V for this purpose is V = Ax^ + By^ + Cz^. See Kittel in [73, p. 427]. 
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If we then express S^^Sy^Sl as matrices in terms of the same basis, then by 
elementary matrix multiphcation, one can show that v\ corresponds to the joint- 
eigenvahic ^ = {0, 1, 1}, V2 corresponds to /i = {1, 0, 1}, and v?, corresponds to 
ju = {1,1,0}. Thus, the eigenvectors 2,1^3 of the Hamiltonian term H 
which arises from a perturbing electric field (defined with respect to x, y, z) are 
also the joint-eigenvectors of the set {5^, S*^, 5*^ }. This implies that we can 

represent H by the expression aSl + bSy + cSl, where H 's eigenvalues are 
{{b + c),{a + c),{a + b)}. 

All of this leads to the following conclusion regarding the measurement of 
the spin of the orthohelium ground state electrons. Let the system be subjected 
to an electric field with orthorhombic symmetry with respect to a given set 
of Cartesian axes x, y, z. Under these circumstances, the measurement of the 
total Hamiltonian will yield a result equal (to first-order approximation) to 
the (unperturbed) ground state energy plus one of the perturbation corrections 
{(6 + c), (a + c), (a + 6)}. If the measured value of the perturbation energy is 
(a -t- &), (a -t- c), or {b -|- c) then the joint values of the set {S'^, 5^, 5^} are given 
respectively by {1, 1, 0}, {1, 0, 1}, or {0, 1, 1}. 

Suppose we consider two such experiments-'^^, one which measures the set 
{5*1, 5^, Sf} and the other of which measures , S'^, , S'^, , i.e. the squares of 

the components in the x,y , z system. The former experiment involves an 
electric field with orthorhombic symmetry with respect to x, y, z, while the latter 
involves an electric field with a different orientation in space, since it must have 
symmetry with respect to the axes x,y , z . Although both procedures can be 
regarded as measurements of they involve quite different experiments. 

It is quite apparent from this example that it would be unreasonable to 
require that a hidden variables theory must assign a single value to S"^, inde- 
pendent of the experimental procedure. 

2.4 Context uality theorems and spectral-incompatibility 



We saw in our discussion of von Neumann's theorem that its implications toward 

hidden variables amounted to the assertion that there can be no mathematical 
function E{0) that is linear on the observables and which maps them to their 

i^The type of experiments mentioned by Kochen and Specker are actually non-ideal mea- 
surements of sets such as {S^, Sy, S'^}, since the post- measurement wave function of the elec- 
trons is not equal to the projection of the pre-measurement wave function into an eigenspace 
of the set {S^, Sy, S'^} (sec section 1.3). In particular, they envision a spectroscopic analysis 
of the atom, i.e., the observation of photons emitted during transitions between the station- 
ary states of the electrons. The fact that they consider non-ideal measurements of each set 
{S^, Sy, } rather than ideal only serves to widen the range of possibilities for different ex- 
perimental procedures, and so strengthens our point that distinct procedures exist for what 
Kochen and Specker consider to be a 'measurement of 5^'. 
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eigenvalues. This was neither a surprising, nor particularly enlightening result, 
since it follows also from a casual observation of some example of linearly related 
non-commuting observables, as we saw in examining the observables -^{(^x + 
ay), cFx,(Ty of a spin ^ particle. The theorems of Gleason, Kochen and Specker, 
and Mermin imply a somewhat less obvious type of impossibility: there exists 
no function E{0) mapping the observables to their eigenvalues, which obeys 
all relationships constraining commuting observables. What we develop here 
is a somewhat simpler expression of the implication of these theorems. We 
will find that thoy imply the spectral-incompaiibility of the value map function: 
there exists no mathematical function that assigns to each commuting set of 
observables a joint-eigenvalue of that set. 

We begin by recalling the notions of joint-cigcnvoctors and joint-eigenvalues 
of a commuting set of observables (0^,0^,...). For a commuting set, the 
eigenvalue equation 1.5 Oltp) = ^\^p) is replaced by a set of relationships 1.7: 
O^ltp) = /i'lV') * = li2, . . ., one for each member of the commuting set. If a 
given \t{j) satisfies this relationship for all members of the set, it is referred to 
as a joint-eigenvector. The set of numbers (^^, . . .) that allow the equations 
to be satisfied for this vector arc collectively referred to as the joint-eigenvalue 
corresponding to this eigenvector, and the symbol fi = (/Xi,/Lt2, . . .) is used to 
refer to this set. The set of all joint-eigenvalues {n^} is given the name 'joint- 
eigenspectrum ' . 

In general, the members of any given commuting set of observables might not 
be independent, i.e., they may be constrained by mathematical relationships. 
We label the relationships for any given commuting set {O^, O^, . . .} as 

/l(o^o^...) = (2.14) 
/2(o^o^...) = 



The equations 2.1 in Gleason's theorem, 2.4 in Kochen and Specker's theo- 
rem, and 2.8, 2.9 and 2.10 in Mermin's theorem, are just such relations. We 
now demonstrate the following two results. First, that every member of the 
joint-eigenspectrum must satisfy all relationships 2.14. Second, that any set of 
numbers ^i, ^2, • • • satisfying all of these relationships is a joint-eigenvalue. 

To demonstrate the first of these, we suppose that fi = (y^^,^^,...) is a 
joint-eigenvalue of the commuting set {0^,0^,...}, with joint-eigenspace H. 
We then consider the operation of fi{0^,0'^,...) on a vector i/j € Ti, where 
fi{0^,0'^, . . .) = is one of the relationships constraining the commuting set. 
We find 

fi{0\0^ . . .)V = /i(Mi,M2, ...)i^ = 0. (2.15) 

The second equality implies that /i(/ii, /X2, • • •) = 0. Since fi{0^,0'^, . . .) = is 
an arbitrary member of the relationships 2.14 for the commuting set {O^, O^, . . .}, 
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it follows that every joint-eigenvalue fj, of the set must satisfy all such relation- 
ships. 

We now discuss the demonstration of the second point. Suppose that the 
numbers {^i,^2,---} satisfy all of 2.14 for some commuting set {0^,0^,...}. 
We consider the following relation: 

([(o^ - f^ir + [o^ ^^ir + . . .m - iAY + io' f^r + ...]...) v = o. 

(2.16) 

Here, we operate on the vector ^ with a product whose factors each consist of a 
sum of various operators. The product is taken over all joint-eigenvalues {/itj}. 
We represent each joint-eigenvalue /ij by a set (/xj, /uf , . . .). The; validity of 2.16 
is easily seen. Since the joint-eigenspaces of any commuting set are complete, 
the vector ip must lie within such a space. Suppose that tp £ Hi, where Hi is 
the joint-eigenspace corresponding to /lij. Then the zth factor in the product 
of operators in 2.16 must give zero when operating on %l>. Therefore, the entire 
product operating on tp must also give zero. Since ip is an arbitrary vector, it 
follows that 

iiO' - + (O^ - /x?)^ + . . .][{0' - l^lf + iO' - nlf + ...]... = 0. (2.17) 

Note that 2.17 is itself a constraining relationship on the commuting set, so that 
it must be satisfied by the numbers (^1,^2, • ■ •)• This can only be true if these 
numbers form a joint-eigenvalue of {0^,0^, . . .}, and this is the result we were 
to prove. 

Prom this, we can discern a simple way to regard the implications of the 
theorems of Gleason, Kochcn and Spcckcr, and Mcrmin toward the question of 
a value map. The requirement 2.2 Gleason's theorem places on the fimction 
E{P) can be re-stated as the requirement that for each commuting set, E{P) 
must satisfy all the relationships constraining its members. From the above 
argument, it follows that this assumption is equivalent to the constraint that 
E{P) must assign to each commuting set a joint-eigenvalue. The same is also 
true of Kochcn and Spcckcr's assumption that -E(s^ ^) satisfy equation 2.5, and 
the Mermin requirement that E satisfy 2.8, 2.9 and 2.10. 

Thus, all three of these theorems can be regarded as proofs of the impos- 
sibility of a function mapping the observables to their values such that each 
commuting set is assigned a joint-eigenvalue. An appropriate name for such a 
proof would seem to be 'spectral- incompatibility theorem.' 

2.5 Albert's example and contextuality 

In thinking about any given physical phenomenon, it is natural to try to picture 

to oneself the properties of the system being studied. In using the quantum 
formalism to develop such a picture, one may tend to regard the 'observables' 
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of this formalism, i.e., the Hermitian operators (see section 1.3), as representa- 
tive of these properties. However, the central role played by the experimental 
procedure £{0) in the measurement of any given observable O seems to suggest 
that such a view of the operators may be untenable. We describe an experiment 
originally discussed by David Albert [1] that indicates that this is indeed the 
case: the Hermitian operators cannot be regarded as representative of the prop- 
erties of the system^^. Albert considers two laboratory procedures that may be 
used to measure the component of the spin of a spin | particle. Although the 
two procedures are quite similar to one another, they cannot not be regarded 
as identical when considered in light of the hidden variables theory known as 
Bohmian mechanics. This is a particularly striking instance of contextuality, 
and it indicates the inadequacy of the conception that the spin operator (Jz 
represents an intrinsic property of the particle. From Albert's example, one can 
clearly see that the outcome of the az measurement depends not only on the 
parameters of the particle itself, but also on the complete experimental setup. 

The Albert example is concerned with the measurement of spin^^ as per- 
formed using a Stern-Gerlach magnet. The schematic diagram given in figure 
2.1 exhibits the configuration used in both of the measurement procedures to be 
described here. Note that we use a Cartesian coordinate system for which the x- 
axis lies along the horizontal direction with positive x directed toward the right, 
and the z-axis lies along the vertical direction with positive z directed upward. 
The y-axis (not shown) is perpendicular to the plane of the figure, and — since 
we use a right-handed coordinate system — positive y points into this plane. The 
long axis of the Stern-Gerlach magnet system is oriented along the x-axis, as 
shown. The upper and lower magnets of the apparatus are located in the di- 
rections of positive z and negative z. We define the Cartesian system further 
by requiring that x-axis (the line defined by j/ = 0, z = 0) passes through the 
center of the Stern-Gerlach magnet system. In each experiment, the spin ^ 
particle to be measured is incident on the apparatus along the positive x— axis. 
In the region of space the particle occupies before entering the Stern-Gerlach 
apparatus, its wave function is of the form 

V't(r) = (^«(r)(| T) + U)), (2.18) 

where the vectors | "f) and | J,) are the eigenvectors of corresponding to 
eigenvalues -\-\ and — ^, respectively. Here iy9t(r) is a localized wave packet 
moving in the positive x direction toward the magnet. 

We wish to consider two experiments that differ only in the orientation of 
the magnetic field inside the Stern-Gerlach apparatus. In the experiment 1, the 
upper iiiMgiu'l lias a stroiig magnetic north pole toward the region of particle 

^^This idea has been propounded by Daumer, Durr, Goldstein, and Zanghi in [39]. See also 

Bell in [22] 

Ks is usual in discussions of Stcrn-Gcrlach experiments, we consider only those effects 
relating to the interaction of the magnetic field with the magnetic moment of the particle. We 
consider the electric charge of the particle to be zero. 
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Figure 2.1: Geometry of the Stern- Gerlach Experiment 



passage, while the lower has a somewhat weaker magnetic south pole toward 
this region. In the experiment 2, the magnets are such that the gradient of 
the field points in the opposite direction, i.e.. the upper magnet has a strong 
magnetic south pole toward the region of passage, while the lower has a weak 
magnetic north pole towards it. After passing the Stern- Gerlach apparatus, the 
particle will be described by a wave function of one of the following forms: 

^l{r) = -L(<^+(r)|T) + <^r(r)U)) (2.19) 
V,^(r) = -L(<^-(r)|T)+0+(r)U)). 

Here Vt (r) corresponds to experiment 1, and ipf{r) corresponds to experiment 
2. In both cases, the function <A>j^(r) represents a localized wave packet moving 
obliquely upward and <p^ (r) represents a localized wave packet moving obliquely 
downward. To measure a^, one places detectors in the paths of these wave 
packets. Examination of the first equation of 2.19 shows that for experiment 1, 
if the particle is detected in the upper path, the result of our measurement 
is -|-^. If the particle is detected in the lower path, the result is — |. For 
experiment 2, the second equation of 2.19 leads to the conclusion that similar 
detections are associated with results opposite in sign to those of experiment 
1. Thus, for experiment 2, detection in the upper path implies = — ^, while 
detection in the lower implies cr^ = 

We make here a few remarks regarding the symmetry of the system. We 
constrain the form of the wave packet fti^) of 2.18, by demanding that it has 
no dependence on y, and that it exhibits reflection symmetry through the plane 
defined by 2 = 0, i.e., (ft{x, z) = ftix, —z). Moreover, the vertical extent of this 
wave packet is to be the same size as the vertical spacing between the upper 
and lower magnets of the apparatus. As regards the wave packets </>^(r) and 
(t>t{Y) of 2.19, if the magnetic field within the apparatus is such that dB^/dz 
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is constant^^, then for both experiments, these packets move at equal angles 
above and below the horizontal (See figure 2.1). Thus, the particle is described 
both before and after it passes the Stcrn-Gcrlach magnet, by a wave function 
which has reflection symmetry through the plane defined by 2; = 0. 



2.5.0.2 Bohmian Mechanics and Albert's example 

We have mentioned that the hidden variables theory developed by David Bohm 
[28] gives an explanation of quantum phenomena which is empirically equivalent 
to that given by the quantum formalism. Bohmian mechanics allows us to 
regard any given system as a set of particles having well-defined (but distinctly 
non- Newtonian [49]) trajectories. Within Bohmian mechanics, it is the the 
configuration of the system q = {qi,q2,q3, ...) which plays the role of the hidden 
variables parameter A. Thus, the state description in this theory consists of both 
tp and q. Bohmian mechanics does not involve a change in the mathematical 
form of tp- just as in the quantum formalism, ip is a vector in the Hilbert 
space associated with the system, and it evolves with time according to the 
Schrodinger equation: 

ih^ = Hi). (2.20) 
The system configuration q is governed by the equation: 

^ = {n/m)lr^{^). (2.21) 

In the case of a particle with spin, we make use of the spinor inner-product in 
this equation. For example, in the case of a spin 5 particle whose wave function 
is = X-f(i')I T) + X-(i')l i)) the equation 2.21 assumes the form 

£^q , X\ (r) VX-F (r) + X- (r) VX- (r) , 

^ = ^'/"^^'"^^ X;(r)x+(r)+X*(r)X-(r) ^'-^'^ 



i^The term added to the particle's Hamiltonian to a<;count for a magnetic field is gs • B, 
where s is the spin, B is the magnetic field and g is the gyromagnetic ratio. To determine 
the form of this term in the case of a Stern-Gerlach apparatus, we require the configuration 
of the magnetic field. A Stern-Gerlach magnet apparatus has a "long axis" which for the 
example of figure 2.1 lies along the x-axis. Since the component of the magnetic field along 
this axis will vanish except within a small region before and after the apparatus, the effects 
of Bx may be neglected. Furthermore, By and Bz within the apparatus may be regarded as 
being independent of x. The magnetic field in the x, z plane between the magnets lies in the 
z-dircction, i.e., B(x,0,z) = Bz{z)k. Over the region of incidence of the particle, the field 
is such that '—Qf- is constant. See for example, Weidner and Sells [102] for a more detailed 
discussion of the Stern-Gerlach apparatus. The motion of the particle in the (/-direction is 
of no importance to us, and so we do not consider the effects of any Hamiltonian terms 
involving only y dependence. The results we discuss in the present section are those which 
arise from taking account of the magnetic field by adding to the Hamiltonian term of the form 
g<TzBz{z = 0) +s<T^(^(z = 0))2. 
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As we expect from the fact that this theory is in empirical agreement with 
quantum theory, Bohmian mechanics does not generally provide, given ip and 
q, a mapping from the obscrvables to their values. In other words, it does not 
provide a non-contextual value map for each state. As we shall see, the choice of 
experimental procedure plays such a pronounced role in the Bohmian mechanics 
description of Albert's spin measurements, that one cannot possibly regard the 
spin operator as representative of an objective property of the particle. 

We first discuss the Bohmian mechanics description of the Albert experi- 
ments. Since the wave function and its evolution are the same as in quantum 
mechanics, the particle's wave function tp is taken to be exactly as described 
above. As far as the configuration q is concerned, there are two important 
features of the Bohmian evolution equations to be considered: the uniqueness 
of the trajectories and the equivariance of the time evolution. The first feature 
refers to the fact that each initial ijj and q leads to a unique trajectory. Since the 
particle being measured has a fixed initial wave function, its initial conditions 
are defined solely by its initial position. The equivariance of the system's time 
evolution is a more complex property. Suppose that at some time t, the proba- 
bility that the system's configuration is within the region dq about q obeys the 
relationship: 

P(q' e dq) = |V(q)|'rfq. (2.23) 

According to equivariance, this relationship will continue to hold for all later 

times t : t > i. In considering the Bohmian mechanics description of a system, 
we assume that the particle initially obeys 2.23. By equivariance, we then have 
that for all later times the particle will be guided to "follow" the motion of 
the wave function. Thus, after it passes through the Stcrn-Gcrlach apparatus, 
the particle will enter either the upward or downward moving packet. From 
consideration of the uniqueness of the trajectory and the equivariance of the 
time evolution, it follows that the question of which branch of the wave function 
the particle enters depends solely on its initial position. 

If we consider the situation in a little more detail, we find a simple crite- 
rion on the initial position of the particle that determines which branch of the 
wave function it will enter. Recall that the initial 2.18 and final 2.19 wave 
function have no dependence on the y coordinate, and that they exhibit reflec- 
tion symmetry through the z = plane. From this symmetry together with 
the uniqueness of Bohmian trajectories, it follows that the particle cannot cross 
the z = plane. In conjunction with equivariance, this result implies that if 
the particle's initial z coordinate is greater than zero, it must enter the upper 
branch, and if its initial z is less than zero the particle must enter the lower 
branch. 

If we now consider the above described spin measurements, we find a some- 
what curious situation. For any given initial configuration q, the question of 
whether the measurement result is CTz = +i or = — ^ depends on the config- 
uration of the experimental apparatus. Suppose that the particle has an initial 
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2 > 0, so that according to the results just shown, it will enter the upper branch 
of the wave function. If the magnetic field inside the Stern-Gerlach apparatus 
is such that dBz/dz < 0, then the particle's final wave function is given by the 
first equation in 2.19, and its detection in the upper branch then implies that 
(Jz = If, on the other hand, the magnetic field of the Stern-Gerlach magnet 
has the opposite orientation, i.e., dB^/dz > 0, then the second equation in 2.19 
obtains and the detection in the upper branch implies that = — ^. Thus, we 
arrive at the conclusion that the "measurement of tr^" gives a different result 
for two situations that differ only in the experimental configiiration. 

The quantum formalism's rules for the measurement of observables strongly 
suggest that the Hermitian operators represent objective properties of the sys- 
tem. Moreover, such a conception is a common clement of the expositions given 
in quantum mechanics textbooks. On the other hand, the fact that the result of 
the "measurement" of ctz can depend on properties of both system and appara- 
tus contradicts this conception. In general, one must consider the results of the 
"measurement of an observable" to be a joint-product of system and measuring 
apparatus. Recall Niels Bohr's comment that [33] "a closer examination reveals 
that the procedure of measurement has an essential influence on the conditions 
on which the very definition of the physical quantities in question rests." For 
further discussion of the role of Hermitian operators in quantum theory, the 
reader is directed to Daumer, Diirr, Goldstein, and Zanghi in [39]. According to 
these authors: "the basic problem with quantum theory . . . more fundamental 
than the measurement problem and all the rest, is a naive realism about opera- 
tors ... by (this) we refer to various, not entirely sharply defined, ways of taking 
too seriously the notion of operator-as-observable, and in particular to the all 
too casual talk about 'measuring operators' which tends to occur as soon as a 
physicist enters quantum mode." 
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Chapter 3 



The 

Einstein— Podolsky— Rosen 
paradox and nonlocality 



In contrast to contextuality, nonlocality is quite an unexpected and surpris- 
ing feature to meet with in the quantum phenomena. As seen in the previous 
chapter, contextuality is essentially the dependence of the hidden variables pre- 
dictions on the different possible experimental procedures for the measurements 
of the observable. Although the nonlocality of a quantum system is not some- 
thing one would regard as natural, it may be proved by a mathematical analysis 
that this feature is inevitable. The demonstration in question arises from the 
well-known Einstein-Podolsky-Rosen paradox [51], in combination with Bell's 
theorem [7]. There exists, however, a mispcrception among some authors that 
what follows from these analyses is only that local hidden variables theories 
must conflict with quantum mechanics. In fact, a more careful examination 
shows that the conjimction of the EPR paradox with Bell's theorem implies 
that any local theoretical explanation must disagree with quantum mechanics. 
To demonstrate this conclusion, we now review the EPR paradox and Bell's 
theorem. We begin with a discussion of the spin singlet version of the EPR 
argument. This will lead us to the presentation of Bell's theorem and its proof. 
We then discuss the conclusion that follows from the conjunction of the EPR 
paradox and Bell's theorem. 
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3.1 Review of the Einstein-Podolsky- Rosen anal- 
ysis 

3.1.1 Rotational invariance of the spin singlet state and 
perfect correlations between spins 

The well-known work of Einstein, Podolsky, and Rosen, first published in 1935, 
was not designed to address the possibility of nonlocality, as such. The title 
of the paper was "Can Quantum Mechanical Description of Physical Reality 
be Considered Complete?" , and the goal of these authors was essentially the 
opposite of authors such as von Neumann: Einstein, Podolsky, and Rosen wished 
to demonstrate that the addition of hidden variables to the description of state 
is necessary for a complete description of a quantum system. According to these 
authors, the quantum mechanical state description given by tp is incomplete, i.e. 
it cannot account for all the objective properties of the system. This conclusion 
is stated in the paper's closing remark: "While we have thus shown that the 
wave function does not provide a complete description of the physical reality, we 
left open the question of whether such a description exists. We believe, however, 
that such a theory is possible." 

Einstein, Podolsky, and Rosen arrived at this conclusion having shown that 
for the system they considered, each of the particles must have position and 
momentum as simultaneous "elements of reality" . Regarding the completeness 
of a physical theory, the authors state: [51] (emphasis due to EPR) "Whatever 
the meaning assigned to the term com,plete, the following requirement for a 
complete theory seems to be a necessary one: every element of the physical 
reality must have a counterpart in the physical theort/^ . This requirement leads 
them to conclude that the quantum theory is incomplete, since it docs not 
account for the possibility of position and momentum as being simultaneous 
elements of reality. To develop this conclusion for the position and momentum 
of a particle, the authors make use of the following sufficient condition for a 
physical quantity to be considered as an element of reality: "If without in any 
way disturbing a system, we can predict with certainty (i.e. with probability 
equaling unity) the value of a physical quantity, then there exists an element of 
physical reality corresponding to this quantity" . 

What we shall present here^ is a form of the EPR paradox which was de- 
veloped in 1951, by David Bohm^. Bohm's version involves the properties of 
the spin singlet state of a pair of spin i particles. Within his argument, Bohm 
shows that various components of the spin of a pair of particles must be elements 
of reality in the same sense as the position and momentum were for Einstein, 
Podolsky, and Rosen. We begin with a discussion of the formal properties of the 

^Thc Bohm spin singlet version and the original version of the EPR paradox differ essen- 
tially in the states and obscrvablcs with which they are concerned. We shall consider the 
original EPR state more explicitly in chapter 4, section 4.1.1. 

2 [25, p. 611-623]. A recent reprint appears within [103, p. 356-368] 
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spin singlet state, and then proceed with our presentation of Bohm's version of 
the EPR argument. 

Because the spin and its components all commute with the obscrvablcs as- 
sociated with the system's spatial properties, one may analyze a particle with 
spin by separately analyzing the spin observables and the spatial observables. 
The spin observables may be analyzed in terms of a Hilbert space 71^ (which 
is a two-dimensional space in the case of a spin ^ particle) and the spatial ob- 
servables in terms of L2(IR)- The full Hilbert space of the system is then given 
by tensor product Hs ® L(Si)2 of these spaces. Hence we proceed to discuss 
the spin observables only, without explicit reference to the spatial observables 
of the system. We denote each direction in space by its 6 and ^ coordinates in 
spherical polar coordinates, and (since we consider spin ^ particles) the symbol 
a denotes the spin. To represent the eigenvectors of ag^^, corresponding to the 
eigenvalues +^ and — |, we write | t 0,(p), and | J. 9,(1)), respectively. For the 
eigenvectors of az, we write simply | "f) and | J,). Often vectors and observables 
are expressed in terms of the basis formed by the eigenvectors of az ■ The vectors 
It 0,^), and \ I 6,(1)) when expressed in terms of these are 

|T^,0) = cos(^/2) I T) + sin(e/2)e'"^ li) (3.1) 
\ie,(t>) = sm{e/2)e-'* 1 1) - cos(e/2) | i). 

We now discuss the possible states for a system consisting of two spin i 
particles^. For such a system, the states are often classified in terms of the total 
spin S = a^^^ + cr^^^ of the particles, where a^^^ is the spin of particle 1 and cr^^^ 
is the spin of particle 2. The state characterized by S* = 1 is known as the spin 
triplet state, ipsT', it is so named because it consists of a combination of the three 
eigenvectors of Sz — the z-component of the total spin — which correspond to the 
eigenvalues —1,0, 1 . The spin singlet state, in which wc shall be interested, is 
characterized by S* = 0. The name given to the state reflects that it contains 
just one eigenvector of the z-component Sz of the total spin: that corresponding 
to the eigenvalue 0. In fact, as wc shall demonstrate, the spin singlet state is an 
eigenvector of all components of the total spin with an eigenvalue 0. We express^ 
the state in terms of the eigenvectors | t)'^': I i)'^' of o'z^' and | t)'^\ I i)'^^ of 
ai^\ as follows: 

V'«« = IT)('Mi)^'^ - U)^'MT)^'^- (3.2) 

For simplicity, we have suppressed the normalization factor Note that 

each of the two terms consists of a product of an eigenvector of cri^' with an 

(2) 

eigenvector of az such that the corresponding eigenvalues are the negatives of 
each oilu^r. 

^See for example, Messiah [81], and Shankar [94]. 

■^Note that a term such as |a)(-'-^ |6)('^^ represents a tensor product of the vector \a)^^^ of 
the Hilbert space associated with the first particle with the vector \b)^^^ of the Hilbert space 
associated with the second. The formal way of writing such a quantity is as; \a)^^') (S) \b)^^^ ■ 
For simplicity of expression, we shall omit the symbol 'ig)' here. 
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If we invert the relationships 3.1 we may then re- write the spin singlet state 
3.2 in terms of the eigenvectors \ '\ 6,^) and \i 0,^) of the component of spin in 
the 6,^ direction. Inverting 3.1 gives: 



1 1) = cos(^/2) \1;9,^) + sm{e/2)e''f' \ie,cP) 
li) = sin(^/2)e-**|t^,.^) - cosmii^,.^). 



(3.3) 



Now let us consider ro- writing the spin singlet state 3.2 in the following manner. 
We substitute for | and | j.)'^^^ expressions involving | f ^^2- and | J, 
02, </'2)'^-' by making use of 3.3. Doing so gives us a new expression for the spin 
singlet state: 



= 1 [sin(^?2/2)e-*"^^ 1 1 62,^2)'^^^ - cos((?2/2) | i 62^2)'^^'^] (3.4) 
-\l}^'^cos{92/2)\^92,h)^'^ + sin(^2/2)e*<^Mi^2,<^2>('^l 



U^2,02) 



(2) 



+ 



U02A2)^^^. 



cos(^2/2)|T)^'^ + sin(^2/2)e^^Mi)^'^ 
sin(^2/2)e-*'^MT)^'^ - cos(^2/2)|i)(^)" 

Examining the second of these relationships, we sec from 3.1 that xpss reduces^: 

V'«« = IT^2,<^2)('Mi^2,<^2)(') - \ie2,h)^''>U e2,h)^^\ (3.5) 

Note that this form is similar to that given in 3.2. Each of its terms is a product 
of an eigenvector of cr^^^^^ and an eigenvector of cr^^^^^ such that the factors 
making up the product correspond to eigenvalues that are just the opposites of 
each other. We drop the '2' from and ^ giving 



(3.6) 



Suppose that we consider the implications of 3.6 for individual measurements 
of cTg^l of particle 1 and the same spin component of particle 2. In each term 
of this spin singlet form, wc have a product of eigenvectors of the two spin 
components such that the eigenvalues are simply the negatives of one another. 
Therefore, it follows that measurements of ct^^^ and a'^Jj^ always give results 
that sum to zero, i.e. if measurement of a'g'^ gives the result then the 

f2) 1 

measurement of cr^^ always gives =F^- We say that these observables exhibit 

perfect correlation. Note that this holds true for pairs of spin observables ag 

(2) 

and ag i whore 0, (j) refer to an arbitrary direction. 



(1) 



^If we multiply a wave function by any constant factor c, where c ^ the resulting wave 
function represents the same physical state. We multiply Vss by —1 to facilitate comparison 
with 3.2 
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3.1.2 Incompleteness argument 



Consider a situation in which the two particles described by the spin singlet state 
are spatially separated from one another, and spin-component measurements are 
to be carried out on each. Since there exist perfect correlations, it is possible 
to predict with certainty the result of a measurement of ai^^ from the result 
of a previous measurement of Suppose for example, if we measure 

and find the result — \- ^ subsequent measurement of cri^^ must give 
a^x^ = If we assume locality then the measurement of a^'^ cannot in any 
way disturb particle 1, which is spatially separated from particle 2. Using the 
Einstein-Podolsky- Rosen criterion, it follows that cri^^ is an element of reality. 
Similarly, one can predict with certainty the result of a measurement of a'y^ 

("21 

from a previous measurement oi cTy . Again, locality implies that measurement 
of a'^^ does not disturb particle 1, and by the EPR criterion, a'^'' must be an 
element of reality. In total we have shown that both a'^'' and a'y^ are elements 
of reality. On the other hand, from the quantum formalism's description of state 
given by ip, one can deduce, at most, one of two such non-commuting quantities. 
Therefore, we may conclude that this description of state is incomplete. More- 
over, since we have perfect correlations between all components of the spins of 
the two particles, a similar argument may be given for any component of a^^\ 
Therefore, particle I's spin component in an arbitrary direction 9, </> must be an 
element of reality. 

Note that we can interchange the roles of particle 1 and 2 in this argument 
to show the same conclusion for all components of particle 2's spin, as well. 

The EPR paradox thus has the following structure. The analysis begins 
with the observation that the spin singlet state exhibits perfect correlations 
such that for any direction 9, 4>, measurements of (7^^| and a'g']^ always give 
results which sum to zero. Together with the assumption of locality, this leads 
to the conclusion that all components of the spins of both particles must be 
elements of reality. Since the quantum state description given by xp does not 
allow for the simultaneous reality of non-commuting quantities such as cri^' and 
Uy^^, it follows that this description is incomplete. 



3.2 Bell's theorem 

The incompleteness of the quantum mechanical state description concluded by 
EPR implies that one must consider a theoretical description of state consisting 
of ip and some additional parameter, in order to fully account for a system's 

properties. In terms of such a state description, one ought to be able to math- 
ematically represent the definite values concluded by EPR using a function^ 

^ Since we are considering a system of fixed "0, namely that of the spin singlet state, no i/' 
dependence need be included in V 



V^(0) mapping each component of the spin of each particle to a value. Here we 
have denoted the supplemental state parameter as A in accordance with the dis- 
cussion of von Neumann's theorem in chapter 1. In 1965, John S. Bell presented 
a famous theorem which addressed the possibility of just such a function on the 
spin observables. Bell was able to show that this formulation must conflict with 
the statistical predictions of quantum mechanics for various spin measurements. 
We now present Bell's theorem. 

Let us first fix our notation. To denote directions in space, we write unit 
vectors such as a,b,c instead of 9 and 0. Rather than using the form V)^{0), we 

shall write ^(A, a) and B{X, h) to represent functions on the spin components 
of particles 1 and 2, respectively. Since the two particles are each of spin |, we 
should have A = ±\ and B = ±|, however for simplicity we rescale these to 
^ = ±1 and B = ±1. 

3.2.1 Proof of Bell's theorem 

The key feature of the spin singlet version of the EPR paradox was its analysis 

of the perfect correlations arising when the two particles of a spin singlet pair 
are subject to measurements of the same spin component. Thus, it may not be 
surprising that Bell's theorem is concerned with a correlation function, which 
is essentially a measure of the statistical correlation between the results of spin 
component measurements of the two particles. The correlation function is to 
be determined as follows: we set the apparatus measuring particle 1 to probe 
the component in the a direction, and the apparatus measuring 2 is set for the 
b direction. We make a series of measurements of spin singlet pairs using this 
configuration, recording the product c^^V?' of the results on each trial. The 
average of these products over the series of measurements is the value of the 
correlation function. 

In general, we expect the value of the average determined in this way to 
depend on the directions a, h with respect to which the spin components are 
measured. According to the quantum formalism, we may predict the average, 
or expectation value of any observable using the formula E{0) = {iplOip). For 
the series of experiments just described, we take the expectation value of product 
of the appropriate spin component observables, giving: 

PQM(a,S) = (a«af )=-a-6. (3.7) 

In the case of the predetermined values, the average of the product of the two 
spin components (T^^a^^ is obtained by taking an average over A: 

P{a,b)= [ d\p{X)A{\,a)B{\,b), (3.8) 
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where p(A) is the probabihty distribution over A. p(A) is normahzed by: 

dX p(A) = 1. (3.9) 



We will now examine the question of whether the correlations function given by 
3.8 is compatible with the quantum mechanical prediction 3.7 for this function. 

Crucial to the EPR analysis is the fact that there is a perfect correlation 
between the results of the measurement of any component of particle I's spin 
in a given direction with the measurement of the same component of particle 
2's spin, such that the results are of opposite sign. To account for this, the 
correlation function must give 

P(a,a) = -lVa. (3.10) 

It is easy to sec that the quantum correlation function satisfies this condition. 
If the prediction derivable using the predetermined values is to reflect this, we 
must have 

A{X, a) = -B{X, a) Va, A. (3.11) 

At this point, wc have enough information to derive the conclusion of the 
theorem. Using 3.8 together with 3.11 and the fact that [A{X, a)]^ = 1, we write 

P{a,b)-P{a,c) = - j dXp{X)[A{X, a) A{X,b)-A{X, a) A{X,c)] (3.12) 

dXp{X)A{X, d)A{X, b) [1 - A{X, b)A{X, c)] 

Using A,B = ±l,we have that 

\P{a,b)-P{d,c)\< I dXp{X)[l- A{X,b)A{X,c)]; (3.13) 



7' 



then using the normalization 3.9, and 3.11 we have 

\P{a,b)-P{a,c)\<l + P{b,c), (3.14) 

and this relation, which is commonly referred to as "Bell's inequality" , is the 
theorem's conclusion. 

Thus, the general framework of Bell's theorem is as follows. The definite 
values of the various components of the two particles' spins are represented by 
the mathematical functions A{X,a), and B{X,b). The condition 

A{X,a) = -B{X,a)ya,X (3.15) 

(equation 3.11), placed on the functions A{X,d), B{X,b) ensures the agreement 
of these functions with the perfect correlations. Bell's theorem tells us that 
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from these conditions it follows that the theoretical prediction for the correlation 
function, P{a,b), must satisfy the Bell inequality, 3.14. 

Based on the fact that the Bell inequality is not satisfied by the quantum 
mechanical correlation function 3.7 (as we shall see below), some authors^ have 
concluded that Bell's theorem proves the impossibility of hidden variables. As 
we shall see, however, this conclusion does not follow. 

3.3 The EPR paradox, Bell's theorem, and non- 
locality 

Recall our discussion of the EPR paradox. Wc found that for a system described 
by the spin singlet state, each component of the spin of one particle is perfectly 
correlated with the same component of the spin of the other. Such perfect cor- 
relations seem to give the appearance of nonlocality, since the measurement of 
one spin seems capable of immediately influencing the result of a measurement 
of its distant partner. The conclusion of nonlocality can be avoided only if all 
components of the spins of each particle possess definite values. This much 
is developed from the EPR analysis. According to Bell's theorem, these defi- 
nite values lead to the prediction that the correlation function P must satisfy 
the inequality 3.14. Since Bell's theorem assumes nothing beyond what can be 
concluded from the spin singlet EPR paradox, one can deduce from the conjunc- 
tion of EPR with Bell that any local theoretical description which accounts for 
the perfect correlations of the spin singlet state leads to a correlation function 
satisfying Bell's inequality. 

Consider now the quantum mechanical prediction for the correlation function 
3.7. If we examine this function, we find that it does not in general satisfy Bell's 
inequality. Suppose, for example, that we have defined some angular orientation 
such that a, b, c all lie in the a;, y plane (so that 6 = 90°), with a along = 60°, 
b along (j) = 0°, and c along (f) = 120°. With this, wc have PQM{a,b) — i, 
PQM{a,c) = \ and PQM{b,c) = -i, so that \PQM{a,b) -PQM{a,c)\ = 1 
and 1 + PQM{b,c) = ^, which is in violation of 3.14. From what wc have 
found above, the disagreement of the quantum mechanical prediction for this 
correlation function with Bell's inequality implies that quantum, m,echanics m,ust 
disagree with any local theoretical description. In the words of Bell: [20] "It is 
known that with Bohm's example of EPR correlations, involving particles with 
spin, there is an irreducible nonlocality." 

Concerning the claim that Bell's theorem is an 'impossibility proof, the fal- 
sity of this conclusion is already indicated by the success of Bohmian mechanics 
(section 1.1.2). On the other hand. Bell's theorem shows that from the exis- 
tence of definite values for the spins follows a conclusion that is in conflict with 

^This is the conclusion reached by the following authors: [24], [55, p. 172], [103, Wigner p. 
291] 



47 



the quantum mechanical predictions. This prompts the question of just what 
feature Bohmian mechanics possesses which allows it to succeed where the quite 
general looking formulation of hidden variables analyzed by Bell docs not. If we 
examine the functions A(A, a) and B{X, h) analyzed by Bell, we see that they do 
not possess the feature we have just seen is present in the quantum theory itself: 
nonlocality. This follows since the value of A docs not depend on the setting 
h of the apparatus measuring particle 2, nor does B depend on the setting a 
of the apparatus measuring particle 1. Thus, Bell analyzes a local theory of 
hidden variables*. Bohmian mechanics, on the other hand, is nonlocal, and it 
is precisely this that allows it to "escape" disproof by Bell's theorem. Hence, 
Bell's theorem does not constitute a disproof of hidden variables in general, but 
only of local hidden variables. 



"This feature is not "accidental": the hidden variables analyzed by Bell are precisely those 
given by the spin singlet EPR analysis — which itself is based on the locality assumption. 
Moreover, Bell did not regard his analysis as a hidden variables impossibility proof, since 
he was aware that: [7] "... a hidden variable interpretation (Bohmian mechanics) has been 
explicitly constructed" . Instead, Bell viewed his theorem as a proof that the "grossly nonlocal 
structure" inherent in Bohmian mechanics "is characteristic ... of any such (hidden variables) 
theory which reproduces exactly the quantum mechanical predictions" . 
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Chapter 4 



Schrodinger's paradox and 
nonlocality 

4.1 Schrodinger's paradox 

In addition to his seminal role in the development of quantum theory itself, 
Erwin Schrodinger made important contributions [91, 92, 93] to its interpreta- 
tion, both in his development of the paradox of "Schrodinger's cat" and in his 
generalization of the Einstein-Podolsky-Rosen paradox. In this latter analysis, 
Schrodinger demonstrated that the state considered by EPR leads to a much 
more general result than these authors had concluded. According to the incom- 
pleteness argument given by Schrodinger, for any system described by a certain 
class of quantum state^, all observables of both particles must be elements of 
reality. As was the case with EPR, Schrodinger's argument was based on the 
existence of perfect correlations exhibited by the state. In this chapter, we show 
that Schrodinger's result can be developed in a simpler way which allows one to 
determine which observables are perfectly correlated with one another using the 
form of the maximally entangled state in question. Moreover, we derive a wide 
variety of new quantum nonlocality proofs based on Schrodinger's generaliza- 
tion of EPR in conjunction with the theorems discussed in chapter 2, Gleason's 
theorem, Kochen and Specker's theorem, and Mermin's theorem. We show that 
any such "spectral-incompatibility" theorem (see section 2.4) when taken to- 
gether with the Schrodinger paradox provides a proof of qiiantum nonlocality. 
Since the spectral-incompatibility theorems involve the quantum predictions for 
individual measurements, the nonlocality proofs to which they lead are of a 
deterministic, rather than statistical, character. A further noteworthy feature 
of this type of quantum nonlocality is that its experimental confirmation need 
involve only the verification of the perfect correlations, with no further obser- 

'maximally entangled state'. 
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vations required. Before addressing these matters, we discuss the original form 
of the EPR paradox. 

4.1.1 The Einstein— Podolsky— Rosen quantum state 

The Bohm version of the EPR paradox presented in chapter 3 was addressed to 
the spin components of two particles represented by the spin singlet state. While 
this argument was concerned with perfect correlations in the spin components, 
the original form of the EPR paradox involved perfect correlations of a slightly 
different form for the positions and momenta. For a system described by the 
original EPR state, we find that measurements of the positions Xi and X2 of 
the two particles give equal results^ while measurements of the momenta give 
results which sum to zero. 

To develop the perfect correlations in position and momentum, we first recall 
how the spin singlet state leads to the spin correlations. The spin singlet state 
takes the form 3.6: 

^ss = \rO,^)®\iO,^) - \10,^) (g> \rO,^), (4.1) 

where we have suppressed the normalization for simplicity. We have here a 

sum of two terms, each being a product of an eigenfunction of a^^^ with an 
(2) 

eigenfunction of ct^^ such that the corresponding eigenvalues sum to zero. In the 

first term of 4. 1 for example, the factors are eigenvectors corresponding to a'g^ = 

5 and = ~5- With this, it is clear that we will have perfect correlations 

between the results of measurement of a^^^^ and crg^|, i.e. measurements of these 
observables will give results which sum to zero. I3y analogy with this result, 
the perfect correlations in position and momentum emphasized by Einstein, 
Podolsky, and Rosen will follow if the wave function assumes the forms: 

/oo 
dp \(l>-p) ^ \(l>p) (4.2) 
-CSO 

/OO 
dx \ipa:) \ipx)- 
-co 

Here \(l)p) is the eigenvector of momentum operator corresponding to a momen- 
tum of p. \ipx) is the eigenvector of position. We now show that the the wave 

^EPR discuss a form in which the difference between the positions is equal to a constant 
they call d. The case we consider differs from this only in that the points of origin from which 

the positions of the two particles are measured are different, so that the quantity X2 — xi + d 

for EPR becomes X2 — xi for our case. 

•^It is clear that the Einstein-Podolsky-Rosen quantum state 4.3 is not normalizable. Nev- 
ertheless, as we saw in the example of the spin singlet state, it is possible to carry out a similar 
argument for states which can be normalized. The 'maximally entangled states', which are 
the subject of the Schrodinger paradox, contain among them a large class of normalizable 
states, as we shall see. 
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function given by the first equation in 4.2: 

1 f°° 

i^EPR = 1^] dp I'A-p) ® I'^p) (4-3) 

assumes also tlie form of the second equation in 4.2. 

To see this we expand the first factor in the summand of iIjepr in terms of 

tpEPR = J dp(^J da;|v?x)(<Px|?!'-p)^ O ItAp)- (4.4) 
Using (<^x|'?^-p) = {(pplfx), this becomes 

dx\ifix) ig) / dp\(j)p){(j)p\ifix). (4.5) 

-OO J — OO 

Since J'^^dp\(j)p){(t)p\ is a unit operator, it follows that 

/oo 
dx\^x) ^ \y^x) (4.6) 
-C30 

This is the result we had set out to obtain. 



4.1.2 Schrodinger's generalization 
4.1.2.1 Maximal perfect correlations 

Schrodinger's work essentially revealed the full potential of the quantum state 

which Einstein, Podolsky, and Rosen had considered. In his analysis, Schrodinger 
demonstrated that the perfect correlations the EPR state exhibits are not lim- 
ited to those in the positions and momenta. For two particles described by the 
EPR state, every observable of each particle will exhibit perfect correlations 
with an observable of the other. What we present here and in subsequent sec- 
tions is a simpler way to develop such perfect correlations than that given by 
Schrodinger ^ This result may be seen as follows. The EPR state 4.3 is rewritten 
as^ 

/oo 
dpidp2{xi,X2\pi,P2){Pl,P2\'ipEPR) = S{X2 - Xi). (4.7) 
-oo 

We may then use a relationship known as the completeness relationship. Ac- 
cording to the completeness relation, if {0„(a;)} is any basis for the Hilbert 

*Scc [92] 

■''The reader may object that for this state, the two particles lie 'on top of one another', 
i.e., the probability that xi ^ X2 is identically 0. However, in the following sections, we show 
that the same conclusions drawn for the EPR state also follow for a more general class of 
'maximally entangled states', of which many do not restrict the positions in just such a way. 
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space L2, then we have '}Z^=i 4'ni^'i^)4'n{x2) = ^(^^2 — Xi). The EPR state may 
be rewritten using this relation as: 



■4'EPr{xi,X2) = ^ (f)l{xi)<j)n{X2), (4.8) 



where {(/)„(a;)} is an arbitrary basis of ^2- Since the form 4.8 resembles the spin 
singlet form 3.6, one might anticipate that it will lead to the existence of perfect 
correlations between the obscrvablcs of particles 1 and 2. As wc shall sec, this is 
true not only for quantum systems described by 4.8, but also for a more general 
class of states as well. 

Let us consider a Hcrmitian operator A (assumed to possess a discrete spec- 
trum) on the Hilbert space ^2- At this point, we postpone defining an observable 
of the EPR system in terms of A — we simply regard A as an abstract operator. 
Suppose that A can be written as 

oo 

A = J2^^n\K){<Pn\, (4.9) 
n=l 

where |(/>„) is the one-dimensional projection operator associated with the 

vector I (/>„). The eigenvectors and eigenvalues of A are respectively the sets 
{|(/>„)}, and Suppose that another Hermitian operator, called A, is defined 

by the relationship 

A , =A* (4.10) 

where A^ , and A^ j are respectively the matrix elements of A and A in the 
position basis {^Px], and the superscript denotes complex conjugation. Note 
that for any given A^ the 'complex conjugate' operator A defined by 4.10 is 
unique. Writing out A^ gives 

K,x' = {'fx\(Y.^^n\K){K\^Wx') (4-11) 

OO 

= ^l^n{'Px\4>n){(l>nWx')- 



n=l 



Using 4.10, we find 



Kx' = T.I'M'I'r.ri^nM* (4-12) 

oo 
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where is just the Hilbert space vector corresponding^ to the function 4>n{^)- 
From the second equation in 4.12, it follows that A takes the form 

oo 

A = j2^^nmm (4.15) 

n=l 

The eigenvectors and eigenvalues of A are respectively the sets {I'/'J^)}, and 

We now consider the observables l^A and 1 on tlic Hilbert space of the 
EPR state, L2®-^2, where 1 is the identity operator on L2. Put less formally, we 
consider A to be an observable of particle 2, and A as an observable of particle 1, 
just as al^^ represented 'the spin component of particle V in our above discussion 
of the spin singlet state. Examining 4.8, we see that each term is a product of 
an eigenvector of A with an eigenvector of A such that the eigenvalues are 
equal. For these observables, we have that iI^epr is an eigenstate of A — A of 
eigenvalue zero, i.e., {A — AjipEPR = 0. Thus, tpEPR exhibits perfect correlation 
between A and A in that the measurements of A and A give results that are 
equal. 

Recall that the L2 basis {^„(a;)} appearing in 4.8 is an arbitrary L2 basis. 
With this, and the fact that the eigenvalues /x„ chosen for A are arbitrary, it 
follows that the operator A can represent any observable of particle 2. We can 
thus conclude that for any observable A of particle 2, there exists a unique 
observable A (defined by 4.10) of particle 1 which exhibits perfect correlations 
with the former. 

We may interchange the roles of particles 1 and 2 in the above argument, if 
we note that the EPR state is symmetric in xi and X2- This latter is proved as 
follows. Since the Dirac delta function is an even function, we have 

5{x2 — xi) = 5{xi — X2)- (4-16) 

Thus the EPR state assumes the form 

00 

tpEPR = S{X2 - Xi) = 6{xi -X2) = ^ (j)n{xi)4>l^{x2), (4-17) 

n=l 

^More formally, we develop the correspondence of Hilbert space vectors |</>) to functions 
4>{x) by expanding in terms of the position eigenvectors \ipx)- 



\<P)= / dx\>fx){>fx\4,). (4.13) 

— oo 

The function (t>{x) can then be identified with the inner product {ipx\<p)- Then the Hilbert 
space vector corresponding to (p*{x) is just 



!</>*) = / dx\vx){vx\<t>)* (4.14) 
dx\ipx){(l>\(px)- 



f 

J — c 
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where {(pn{x)} is an arbitrary basis of L2, and the second equality is the com- 
pleteness relation. This establishes the desired symmetry. 

Suppose now that we consider the obscrvablcs A (g) 1 and 1 (E) A, i.e., we 
consider A as an observable of particle 1, and A as an observable of particle 2. 
Then from the symmetry of the EPR state, it follows that we can reverse the 
roles of the particles in the above discussion to show that for any observable A 
of particle 1, there exists a unique observable A of particle 2 defined by 4.10, 
which exhibits perfect correlations with the former. 

The perfect correlations in position and momentum originally noted by Ein- 
stein, Podolsky and Rosen are a special case of the perfectly correlated observ- 
ables A and A given here. EPR found that the measurement of the positions xi 
and X2 of the two particles must give results that are equal. The momenta pi 
and p2 showed slightly different perfect correlations in which their measurements 
give results which sum to zero. To assess whether the EPR perfect correlations 
in position and momentum arc consistent with the scheme given above, we de- 
rive using 4.10 the forms of x and p. Since the position observable x is diagonal 
in the position eigenvectors and has real matrix elements, 4.10 implies that 
X = X. As for the momentum, this operator is represented in the position basis 
by the differential operator —ifi-£:. Using 4.10 we see that p is just equal to 
the negative of p itself. Hence the perfect correlations developed in the present 
section, according to which A and A are equal, imply that measurements of the 
positions of the two particles must give equal results, whereas measurements of 
the momenta pi and P2 must give results which sum to zero. This is just what 
EPR had found. 

4.1.2.2 Incompleteness argument 

The existence of such ubiquitous perfect correlations allows one to develop an 
argument demonstrating that all obscrvablcs of particle 1 and all observables 
of particle 2 are "elements of reality", i.e., they possess definite values. This 
argument is similar to the incompleteness argument given above for the spin 
singlet EPR case. We consider the possibility of separate measurements of 
observables A, A being performed respectively on particles 1 and 2 of the EPR 
state. As in the spin singlet EPR analysis, we assume locality, so that the 
properties of each particle must be regarded as independent of those of its 
spatially separated partner. Suppose we perform an experimental procedure 
S{A) which constitutes a measurement of the observable A of particle 1. Since 
A is perfectly correlated with A, the result we find allows us to predict with 
certainty the result of any subsequent experiment £{A) measuring A of particle 
2. For example, if S{A) gives the result A = jia, then we can predict with 
certainty that £{A) will give A = fia- Since we have assumed locality, A miist 
be an element of reality, i.e., there exists some definite value E[A). Furthermore, 
since the same number is predicted no matter what experimental procedure £{A) 
is used in ^'s measurement, E{A) cannot depend on the choice of procedure, 
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i.e., E(A) must be non-contextual. 

We saw in the discussion above that for any observable A of particle 2, 
there exists an observable A of particle 1 with which the former shows perfect 
correlation. Hence, the above argument may be applied to any observable of 
particle 2, and we can therefore conclude that all observables of particle 2 are 
elements of reality, and that the definite value of each is non-contextual. Of 
course, if we consider the set of all observables of particle 2, there are some pairs 
among the set which are non-commuting. Since, from the quantum formalism's 
description of state, one can deduce, at most, one observable of such a pair, we 
may conclude that this description of state is incomplete. Thus, any theoretical 
structure which hopes to capture these definite values must contain a state 
description which extends that of quantum mechanics. 

As we noted above, it is possible to interchange the roles of the two particles 
when deriving the perfect correlations. Thus, to every observable A of particle 
1, there is a unique observable A of particle 2 with which the former is perfectly 
correlated. This being the case, one can construct a similar incompleteness ar- 
gument to show that all observables of particle 1 possess non-contextual definite 
values. 

Finally, note that if we consider the possibility of an experiment measuring 
a commuting set of observables of either particle, the quantum theory predicts 
that the result is one of the joint-eigenvalues of that set. For agreement with 
this prediction, the values E{0) assigned to the observables of this particle must 
map each commuting set to a joint-eigenvalue. 

The Schrodinger paradox is seen to have the following structure. The analy- 
sis begins with the observation that the EPR state 4.8 exhibits maximal perfect 
correlations such that for any observable A of either particle, there is a unique 
observable A of the other with which A is perfectly correlated. If we assume 
locality, then these perfect correlations imply the existence of a non-contextual 
value map on all observables of both particles. These value maps must assign 
to every commuting set a joint-eigenvalue. 

The key to the above incompleteness argument is that one can, by mea- 
surement of an observable of particle 2, predict with certainty the result of any 
measiircmcnt of a particular observable of particle 1. In his presentation of 
the EPR incompleteness argument (which concerns position and momentum) 
Schrodinger uses a colorful analogy to make the situation clear. He imagines 
particles 1 and 2 as being a student and his instructor. The measurement of par- 
ticle 2 corresponds to the instructor consulting a textbook to check the answer 
to an examination question, and the measurement of particle 1 to the response 
the student gives to this question. Since he always gives the correct answer, i.e., 
the same as that in the instructor's textbook, the student must have known the 
answer beforehand. Schrodinger presents the situation as follows: [91] (emphasis 
by original author) 

Let us focus attention on the system labeled with small letters 
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p, q and call it for brevity the "small" system. Then things stand as 
follows. I can direct one of two questions to the small system, either 
that about q or that about p. Before doing so I can, if I choose, 
procure the answer to one of these questions by a measurement on 
the fully separated other system (which we shall regard as auxiliary 
apparatus), or I may take care of this afterwards. My small system, 
like a schoolboy under examination, cannot possibly know whether 
I have done this or for which questions, or whether or for which I 
intend to do it later. From arbitrarily many pretrials I know that the 
pupil will correctly answer the first question I put to him. From that 
it follows that in every case, he knows the answer to both questions. 
. . . No school principal would judge otherwise . . . He would not come 
to think that his, the teacher's, consulting a textbook first suggests 
to the pupil the correct answer, or even, in the cases when the teacher 
chooses to consult it only after ensuing answers from the pupil, that 
the pupil's answer has changed the text of the notebook in the pupil's 
favor. 

4.1.2.3 Generalized form of the EPR state 

Consideration of the structure of the EPR state 4.8 suggests the possibility that 
a more general class of states might also exhibit maximal perfect correlations. 
Before developing this, we first note that the formal way to write 4.8 is such 
that each term consists of a tensor product of a vector of particle I's Hilbert 
space with a vector of particle 2's Hilbert space: 



We shall often use this convenient notation in expressing the maximally en- 
tangled states. The operation of complex conjugation in the first factor of the 
tensor product may be regarded as a special case of a class of operators known 
as "anti-unitary involutions" — operators which we shall denote by C, to suggest 
complex conjugation. We will find that any state of the form 



will exhibit ubiquitous perfect correlations leading to the conclusion of definite 
values on all observables of both subsystems. 

An anti-unitary involution operation represents the generalization of com- 
plex conjugation from scalars to vectors. The term "involution" refers to any 
operator C whose square is equal to the identity operator, i.e. = 1. Anti- 



oo 




(4.18) 



OO 




(4.19) 
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unitarity^ entails two conditions, the first of wliicli is anti- linearity: 



C(ci|V'i> + C2IV2) + ...)= c^Cl^i) + c;C|V2) + . . . , (4.20) 

where {ci} arc constants and {]?/'«)} are vectors. The second condition is the 
anti-linear counterpart of unitarity: 

{ci^m = im* = m) vv', (4.21) 

which tells us that under the operation of C, inner products arc replaced by 
their complex conjugates. Note that this property is sufficient to guarantee 
that if the set {|<?^n)} is a basis, then the vectors {C|^„)} in 4.19 must also form 
a basis. For each anti-unitary involution C . there is a special Hilbcrt space basis 
whose elements are invariant under C. The operation of C on any given vector 
IV') can be easily obtained by expanding the vector in terms of this basis. If 
{y„} is this special basis then we have 

oo oo 

C|V)=CEk^>('P#) = El'^^)('^»l^)*- (4.22) 

i=l i=l 

When one is analyzing any given state of the form 4.19, it is convenient to 
express the state and observables using this special basis. The EPR state is a 
special case of the state 4.19 in which the anti-unitary involution C is such that 
the position basis {|</3a;)} plays this role. 

The state 4.19 shows an invariance similar to that we developed for the EPR 
state: the basis in terms of which the state is expressed, is arbitrary. We 

now develop this result. Note that the expression 4.19 takes the form 



V'C = E ^ E \Xi){Xi\K) P E \Xj){Xj\<i>n) , (4.23) 

if we expand the vectors in terms of an arbitrary basis {xO- -Applying the C 
operation in the first factor and rearranging the expression, we obtain 

oo / oo \ / oo \ 

= E EC|Xi)(<^n|Xi)U ElXiXXil'/'n) (4.24) 
n=l \i = l J J 

oo oo oo 

= ^^^iXj\<f>n){^n\Xi)C\Xi)<»\Xj) 

i^l j^l n^l 

CJO oo / oo \ 

= EE^Xil E \^r.){<f>n\ \Xi)C\Xi) \Xj), 

1=1 j=l \n=l / 



^This term may appear confusing for the following reason. It does not not refer to an 
operator which is "not unitary" , but instead the prefix "anti" refers to anti- linearity. 
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where the first equahty foUows from the anti-hnearity of C . Since the expres- 
sion X^^i \(l)n){4'n\ is the identity operator, the orthonormaHty of the set {xi} 
impHes that 

CXD OO OO 

i'C = EE'^'^C'IXi) ® IX.) = E^IX»> ® \X^)■ (4.25) 

i=l j=\ i=\ 

Thus, the form of the state 4.19 is invariant under any change of basis, and we 
have the desired result. 

Note then the role played by the properties of anti-linear unitarity and anti- 
linearity: from the former followed the result that the the vectors {C|(^„)} form 
a basis if the vectors {| do so, and from the latter followed the invariance 
just shown. 

Consider a Hermitian operator A on which can be written as 

OO 

A = Y,^^n\<t>n){(|>n\■ (4-26) 

n=l 

Note that A's eigenvectors and eigenvalues are given by the sets {!</'«)} and 
respectively. If we define the observable A by the relationship 

A = CAC-^, (4.27) 

then A's eigenvalues are the same as A's, i.e., and its eigenvectors are 

given by {C|^„)}. To see this, note that 

CAC-^C\(j)n) = C^ln\(i>n) = /x„C|.^„), (4.28) 

where the first equality follows from CC~^ = 1 and A|(/>n) = ^J^n4'n■ Since 
= 1, it follows that C = C~^, and we may rewrite 4.27 as^: 

A = CAC. (4.31) 

Note that for any given A, the observable A defined by 4.31 is unique. 

If we identify A as an observable of subsystem 2, and A as an observable of 

subsystem 1, then examination of the state 4.19 shows that these exhibit perfect 
correlations such that their measurements always yield results that arc cqiial. 

*Note that to evaluate A, one can express it using its matrix elements with respect to the 
invariant basis {ipn} of C. If we evaluate the matrix element CACij, we find 

{<Pi\CAC>pj) = ivilCAipj) = {>Pi\A<pj)*, (4.29) 

where the second equality follows from 4.22. Prom 4.29 follows the relationship 

Aij=A*^, (4.30) 

which is a convenient form one may use to evaluate A, as we will see in section 4.1.3. Note 
that 4.30 reduces to 4.10 when the invariant basis {(fin} of C is the position basis Iv'x)- 
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From the invariance of the state, we can then conclude that for any observable 
A of subsystem 2, there is a unique observable A of subsystem 1 defined by 4.31 
which exhibits perfect correlations with A. Since, as can easily be proved^, the 
state 4.19 assumes the form 

oo 

^-0 = 5^ ]</>„) ^CI-^J, (4.33) 

one may develop the same results with the roles of the subsystems reversed, 
i.e., for any observable A of subsystem 1, there exists a unique observable A of 
subsystem 2 which exhibits perfect correlations with A. 

An incompleteness argument similar to that given above can be given, and 
one can show the existence of a value map E{0) on all observables of both 
subsystems. 



4.1.2.4 The general form of a maximally entangled state 

States exhibiting ubiquitous perfect correlations are not limited to those of the 
form 4.19. If we examine any composite system whose subsystems are of the 
same dimensionality^^ and which is represented by a state^^ 

N 

i^ME = ^\i^n}<S)\<pn}, (4-35) 
n=l 

where {IV'n)} is any basis of subsystem 1 and {|0n)} is any basis of subsystem 

2, wc find that each observable of cither subsystem exhibits perfect correlations 
with some observable of the other subsystem. To examine the properties of the 

"The proof of this result follows similar lines as the invariance proof given above. One 
expands the vectors of the state 4.19 in terms of the invariant basis {ifn}- Doing so leads 
to an expression similar to the first equality in 4.25, with ipi and ipj replacing Xi ^^id Xji 
respectively. The delta function is then replaced using the relationship 

Sij = iv^il I0n>{0n| j \tpj}, (4.32) 

and one can then easily develop 4.33 

-'-"The derivations of this section can be carried out for an entangled system whose subsys- 
tems are either finite or infinite dimensional. Infinite sums may be substituted for the finite 
sums written here to develop the same results for the infinite dimensional case. 

^^This is equivalent to the form 

JV 

i^ME =^^Cn\lpn) (S)\4>n), (4.34) 
n=l 

where {\ipn)} is any basis of subsystem 1 and {|</>n)} is any basis of subsystem 2 and the 

|c„p = 1 Vn. 
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state 4.35, we note that it may be rewritten^^ as 

N 

71=1 

where U is the anti-unitary operator defined by ?7|(/)„) = \ipn)- Recall from the 
above discussion that (anti-linear) unitarity and anti-linearity are sufficient to 
guarantee both that {U\(j)n)} is a basis if {|0n)} is, and that the state shows the 
invariance we require. We conclude our presentation of the Schrodinger paradox 
by discussing this general form. 

We consider a Hermitian operator A on Hn which can be written as 

N 

A = J2f^n\ct>n)){ct>n\. (4.38) 

71=1 

We then define the observable A by^^ 

A = UAU-K (4.40) 

For any given A, this defines a unique operator A. Using A\^n) = l^n\(l>n) and 
UU~^ = 1, we have 

UAU-^U\<j)n) = ^inUq)n, (4.41) 

so that the eigenvectors and eigenvalues of A are given by {U(j)n} and 

Identifying A as an observable of subsystem 2 and A as an observable of sub- 
system 1, we can see from the form of 4.37 that these exhibit perfect correlations 
for such a state. As in the EPR case and its generalization, the basis-invariance 
and symmetry of the state imply that for any observable A of either subsystem, 
there is a unique observable A of the other with which A is perfectly correlated. 
An argument similar to that given for the EPR state leads to the existence 
of a non-contextual value map E{0) on all observables of subsystem 1 and all 
observables of subsystem 2. 

At this point, one may address the objection that the EPR state 4.8 makes the positions of 
the two particles coincide. For example, one can consider an anti-unitary operator Ud defined 
by 

poo 

UaW= / dx\y:>^){^P\^^+a), (4.36) 



where d is an arbitrary constant. Then, in the state 4.37, the two particles are separated by 
a distance d. 

^^For comparison with the form 4.10, and 4.30, we note that if we express the operator U 
as U = CU with C an anti-unitary involution, and U a unitary matrix, then 4.40 leads to 

Ai^ = {UAU-^)1^, (4.39) 

where the ij subscript indicates the ijth. matrix element in terms of {Vn}, the basis that is 
invariant under C. 
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4.1.3 The spin singlet state as a maximally entangled state 



It is instructive to make an explicit comparison of the properties given above for 
a general maximally entangled state with those of the spin singlet state. The 
spin singlet state takes the form 

i}ss = \^eA) ®\ieA) - \ie,(i>) ®\]e,4>), (4.42) 

when expressed in terms of the eigenvectors of (T0_<^. We have suppressed the 
normalization for simplicity. Inspection of this form leads one to conclude that 
the state must exhibit correlations such that the results of measurement of 0-g|^ 

and a^^^ will give results which sum to zero. These might be better named 

"perfect anti-correlations" , since the measurement results arc the negatives of 
one another. On examining the form given in 4.34, one can see immediately 
that the spin singlet state 4.42 is a maximally entangled state. 

This being the case, it follows that the spin singlet state must assume the 
form 4.37. To see this, we write the anti-unitary operator U — CU, where U is a 

unitary operator given by [7 ~ ^ ^ ^ ^ and C is an anti-unitary involution 

under which the az eigenvectors are invariant, i.e., C| t) = | T) and C| |) = 1 1). 
Using 1 1), I i) as the basis in the maximally entangled state expression 4.37, we 
obtain 

^ = C/|t)0|T) + t/|i)® li). (4.43) 
This reduces-^^ to the familiar spin singlet form 

^- = 1?) ® U> - li) ® IT), (4.44) 

when we write out the operation of U as a matrix multiplication. 

The perfect correlations between spin components of the two particles can be 
regarded as a special case of the maximally entangled state perfect correlations, 
which hold between any observables A of one subsystem and A of the other, 
where A = UAU~^. To develop this, we recall that the observable A exhibits 
perfect correlations with the observable A such that measurements of A and A 
give results that are equal. In the case of the spin singlet state, we have what 
one might call "perfect anti-correlations", i.e., measurements of a^^^ and 
give results which sum to zero. Thus, for the case of the spin singlet state, we 
expect that 0$^ = —(Je,<i,-, or 

Uae,4,U~'^ = -ae,^, (4.45) 

where U is the operator described above. Using = C, one can sec that the 
left hand side of 4.45 reduces to CUae,(t>U~^C . In deriving 4.45, we begin by 
evaluating the expression UcTe,(j,tJ~^ . 

^*To within an overall minus sign, which can be ignored. 
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Recall that the form of the observable ag_^ is given by 

/ cos(6') e-''^sin(6') \ 
'''"^=[e^^Me) -eos(^?)j- (^-46) 

Note that one can write this as ae^^ — \ ^ ^ ) ' where a = cos{9) and 



b = e ^'^sm{6). A simple calculation involving matrix multiplication shows 
that 

_^ )&-'=(-_: -"I). ("7) 

To complete the calculation, wc must evaluate the expression CAC, where A 

is given by the matrix ^ ) '^^ section 4.1.2.3 an expression 

of the form CAC can bo evaluated in terms of its matrix elements in {fn}j the 
basis that is invariant under C. To do so, wc use the relation 4.29: 

{CAC)ij=A*^. (4.48) 

Using 4.48, it follows that ^ ^a)^~(^ b* a)'^^ comparing 

this relation to the form of (7$^^, one can see that CUa0^^U~^C = —(Je,4>-, and 
we have arrived at 4.45. 

4.1.4 Bell's theorem and the maximally entangled states 

We have seen, in chapter 3, how quantum nonlocality may be proved for one 
particular maximally entangled state, which is the spin singlet state. Here, 
we show how Bell's theorem may be applied in proofs of the nonlocality of a 
larger class of maximally entangled states. A more general form of the proof 
we give here is given by Popescu and Rohrlich [87], who demonstrate that the 
nonlocality of any maximally entangled state follows from Bell's theorem. 
Consider a general maximally entangled state given by 

JV N 

i'ME = J2 l*^") ® 1^") = l*^!) ® l^l) + l'^2) IV'2) + Yl 1"^") ® IV'n)- (4-49) 
n=l n=3 

Let us define Hi and 7^2 as the subspaces of particle 1 and 2's Hilbert spaces 
spanned respectively by 4>i , 4>2 and 'ip\,ip2- We define the sets of observables 
^"^^ {^e^i} ^ follows. The set {Ce^],} is formally identical^^ to the set 
{^e,<j>} on Hi, and all its members give zero when operating on any vector in 
the orthogonal complement of Jii. Similarly, the set {Q ^} is formally identical 

Since any two Hilbert spaces of the same dimension are isomorphic, it is possible to define 
such formally identical observables. 
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to {<Je^^} on 7^2, and its members give zero when operating on any vector in its 
orthogonal complement. We now select a class of states smaller than that given 
by 4.49 by making the following substitutions: the vectors | "f) and | J,) replace 
(pi and (j)2 and the vectors 1 1) and — 1 1) replace ^pi and tp2- Here, 1 1) and 1 1) 
are the eigenvectors of cr^. The state 4.49 then becomes 

N 

V' = I T) U) - U) I T) + ^ (/-n V'n- (4.50) 

n=3 

The sets of observables {Cg^j,} and {Cg^],} are zero on every term of 4.50 with 
the exception of the first two. Since the first two terms are identical to the spin 
singlet state, it follows that ip is an eigenstate of ^g^^ + ^g^^ of eigenvalue zero, 
\/0, (j), and therefore perfect correlations between all such observables. We may 
exploit this situation to derive a nonlocality proof with some further effort, as 
we now show. 

Let P(-'^) and P^^^ be the projections operators which project respectively 
onto Til and 7^2- Since the state 4.50 exhibits maximal perfect correlations, 
one can give an incompleteness argument to show that P^^\P^'^\ and the sets 
{Cf^} and {Cf^} must all possess definite values. However, the existence of such 
values cannot agree with the statistical predictions of quantum mechanics, as 
one may see. We now examine what might be called the "conditional correlation 
function" , which we define as follows. As was done in the presentation of Bell's 
theorem, wc represent the predetermined values of the observables {Cg^],} and 

(2) 

{Q ^} using the mathematical functions A{X,a) and B{X,b). Suppose that we 
consider measurements of the set {Ca^\P^^^} of subsystem 1 and {^p\P^^^} of 
subsystem 2. In every such case, we discard those results for which either P^^^ or 
p(^) (or both) equals zero. We consider the correlation in measurements of ^^^-^ 
and S^^^ in those cases for which P'^^ = P'^' = 1. Under these conditions, the 
quantum mechanical prediction for the correlation function PQM{a,b) is given 

by 

^Qm(4'^4'^) = i^sslC^kf^li^ss) =-&-b. (4.51) 

On the other hand, the prediction derived from the predetermined values A{X, a) 
and P(A, b) can be shown to satisfy Bell's inequality 3.14. Therefore, the statis- 
tics of these values — which themselves follow from the assumption of locality — 
are in conflict with the predictions of quantum mechanics and so we must con- 
clude quantum nonlocality. 
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4.2 Schrodinger nonlocality, and a discussion of 
experimental verification 

4.2.1 Schrodinger nonlocality 

Wc have seen that the Schrodinger paradox gives a more general resiilt than fol- 
lows from either version of the EPR paradox. Because of this greater generality, 
it is possible to construct a nonlocality proof using Schrodinger's paradox in con- 
junction with any one of a wide variety of theorems besides that of Bell. This is 
due mainly to the fact that the Schrodinger paradox predicts definite values for 
such a large class of observables that the theorems required need not address 
more than one particle or subsystem — even value map impossibility proofs which 
are concerned with single particle systems may be sufficient. Consider the case 
of Kochen and Specker's theorem. Suppose that some system is described by a 
maximally entangled state whose subsystems are of dimensionality three: 

3 

^ 0„ O i^n- (4.52) 

n=l 

Among the set of all observables on both Hi and W2 are the squares of the 
various spin components'^ of a spin 1 particle, {,Sg ^}. The state 4.52 exhibits 
perfect correlations between all the observables {sg of one siibsystem, and 

their counterparts {sg of the other. As we saw in the incompleteness argu- 
ment given in section 4.1.2.2, if we assume locality, then the perfect correlations 
imply the existence of a non-contextual value map E{0) on the sets {sg 

and {sg ^}. However, we know from the Kochen and Spcckcr theorem that 
there exists no value map {-E'(sg ^)} such that a joint-eigenvalue is assigned 
to every commuting set. This conclusion contradicts the quantum mechanical 
prediction that the measurements of any commuting set always give one of its 
joint-eigenvalues. Thus, if we demand that the perfect correlations of the state 
4.52 are to be explained through a local theory, we are led to a conclusion that is 
in conflict with quantum mechanics. Therefore the quantum description of any 
system described by a state such as 4.52 must entail nonlocality. Note that the 
development of the contradiction between the quantum predictions and those of 
the definite values is expressed using the observables of one subsystem, namely 
the set {sl ,p}. 

Moreover, if we consider any maximally entangled state of dimensionality N 

^^Of course, such a system need not consist of two spin 1 particles. If it does not, tlien the 
same conclusion as is given here holds for those observables which are formally equivalent to 

the sets {sg ^} and {sg ^}. Since any two Hilbert spaces of the same dimension are isomorphic, 
it follows that such formally equivalent observables must exist. 
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of at least three: 

N 

then we can evidently develop a nonlocality proof for this state by using the 
Kochen and Specker theorem. To see this, we re-write 4.53 as 

N 

= 01 Ck) 'i/'l + '/>2 (8) V'2 + 03 O V'a + X ■'/'n O V'n, (4.54) 

and we define Ti.2 to be the subspace of particle 2's Hilbert space spanned by 
the vectors 'ipi,tp2,''p3, and the operator P as the projection operator of Ti2- Let 
us define the set of observables {Ce,^} such that they are formally identical to 
the squares of the spin components {Ce,^} on Ti.2 and give zero when operating 
on any vector in its orthogonal complement. Then the Kochen and Specker 
theorem tells us that the existence of definite values for the set {P,{Cg^}} must 
conflict with quantum mechanics. To see this, consider those joint-eigenvalues 
of {P, {Ce,<p}} for which P = 1. The corresponding values of the set {Ce,<j>} in 
this case are equal to joint-eigenvalues on the spin observables themselves. Since 
the Kochen and Specker theorem implies the impossibility of an assignment of 
values to the spin components, then the same follows for the observables {Ce,4,} 
given that P = 1. 

If we consider the conjunction of Schrodinger's paradox with either Glca- 
son's theorem or Mermin's theorem, we again find that quantum nonlocality 
follows. Prom Gleason's theorem we have that a value map on the set of all 
projections {P} must conflict with quantum mechanics. One can show that for 
any maximally entangled state 

N 

X^W^^i^), (4.55) 

n=l 

where N is at least three, we have quantum nonlocality. This result also holds 
true for for maximally entangled staters of infinite-dimensionality, in which case 
the sum in 4.55 is replaced by an infinite sum. In the case of Mermin's theorem, 
we can show that for any maximally entangled state whose subsystems are at 
least four dimensional, the quantum predictions must entail nonlocality. 

Thus, the nonlocality of the maximally entangled states may be proved not 
only by the analyses involving Bell's theorem (sec section 4.1.4), but also by 
consideration of any of the theorems of Gleason, Kochen and Specker, or Mer- 
min, taken together with the Schrodinger paradox. We have seen that the latter 
type of argument, which we have called "Schrodinger nonlocality", differs from 
that involving Bell's theorem in several ways. First, the Schrodinger nonlo- 
cality it is of a deterministic, rather than statistical character. This is due 
to the fact that the conflict of the Schrodinger incompleteness with quantum 



65 



mechanics is in terms of the quantum prediction for individual measurements, 
i.e., the prediction that measurements of a commuting set always give one of 
that set's joint-cigcnvalucs. Second, such quantum nonlocality proofs can be 
developed from these theorem's implications regarding the observables of just 
one subsystem. Finally, the Schrodinger nonlocality can be proved for a larger 
class of observables than can the EPR/Bell nonlocality. Note that beyond the 
arguments presented here, there remains the possibility of further instances of 
Schrodinger nonlocality, since any 'spectral incompatibility theorem' (see sec- 
tion 2.4) leads to such a demonstration. If the theorem in question concerns 
a class of observables on an A^-dimensional Hilbert space, then the proof can 
be applied to any maximally entangled states whose subsystem's are at least 
A'^-dimensional. 

4.2.2 Discussion of the experimental tests of EPR/Bell 
and Schrodinger nonlocality 

In chapter three, we reviewed the demonstration that the spin singlet version of 

the EPR paradox in conjunction with Bell's theorem leads to the conclusion of 
quantum nonlocality. That quantum theory entails such an unusual feature as 
nonlocality invited physicists to perform laboratory experiments to test whether 
the quantum predictions for the relevant phenomena arc actually borne out. 
The experimental tests^^ inspired by the discovery of quantum nonlocality have 
focused mainly^^ on a maximally entangled system of two photons rather than 
two spin i particles. The results of these experiments are in agreement with 
the quantum mechanical predictions^^. Since we have described the spin singlet 
version of the EPR/Bell nonlocality in some detail (chapter 3), we address this 
case rather than the two-photon maximally entangled state. 

Let us briefly recall how the EPR argument and Bell's theorem give rise to 
the conclusion of quantum nonlocality. According to the EPR argument, the 
perfect correlations exhibited by the spin singlet state can be explained under 
locality only if there exist definite values for all components of the spins of 
both particles. Bell's theorem shows that any such definite values lead to the 
prediction that the 'correlation function' P{a, b) must satisfy Bell's inequality. 
Thus, if we combine the EPR paradox with Bell's theorem we obtain the argu- 
ment that any local theoretical explanation of the perfect correlations must give 
a prediction for P{a, b) that satisfies the Bell inequality. On the other hand, 

^^See for example, Freedman and Clauser [53], Pry and Thompson [54], and Aspect, et. 
al. [3, 4, 5]. For the two-photon systems studied by these authors, an analysis first given 
by Clauser, Horne, Holt and Shimony [37] plays the role of Bell's theorem: the assumption 
of locality together with the existence of the perfect correlations in the photon polarizations 
leads to the CHHS inequality, which relationship is not generally satisfied by the quantum 
mechanical predictions. 

^*The experiment of Lamehi-Rachti, and Mittig [75] involves a pair of protons described by 
the spin singlet state. These results are in support of the quantum predictions. 

^^For a discussion of the experimental results, see following: [10, 12, 64). 
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the quantum mechanical predictions for P{a, b) violate this inequahty. Thus, 
any local theoretical description of the spin singlet state must conflict with the 
quantum mechanical one. 

We now consider what such a laboratory test of quantum nonlocality would 
entail. The EPR/Bell analysis depends on the correctness of the quantum me- 
chanical predictions of perfect correlations. If these quantum predictions are 
not confirmed, i.e., the experimental test does not reveal perfect correlations, 
one could not interpret the experiment in terms of EPR/Bell. It is only with 
the observation of the perfect correlations that further tests can be performed 
to make an experimental judgment between quantum mechanics and the fam- 
ily of local theories. Such a judgment can be made based on the appropriate 
mcasiircments of the correlation function P, and the comparison of the results 
with the Bell inequality. One must measure the correlation function for just 
those angles a,b,c for which the quantum mechanical prediction violates the 
Bell inequality (see section 3.3). If the results of these tests agree with the Bell 
inequality, then the quantum predictions are refuted, and the experiment may 
be interpreted in terms of a theory consistent with the concept of locality. If 
the results disagree with the Bell inequality, this would confirm the quantum 
mechanical predictions and would support the notion of nonlocality. 

4.2.2.1 Test of Schrodinger nonlocality by perfect correlations only 

If one examines the various proofs of quantum nonlocality, one finds that all 
have a somewhat similar structure. The first part of such a proof consists of an 
EPR-like incompleteness argument, according to which the perfect correlations 
exhibited by some system together with the assumption of locality arc shown 
to imply the existence definite values for certain observables. The second part 
of a nonlocality proof is a demonstration that such definite values must conflict 
with the predictions of quantum mechanics. The two parts can be combined 
to produce a proof of quantum nonlocality. When we come to consider the 
laboratory confirmation of such quantum nonlocality, it is natural to expect 
that the two part structure of its proof will be reflected in the various stages 
of the experiment itself. As we have seen above, this is indeed true for the test 
of the EPR/Bell nonlocality: one must observe the perfect correlations, and 
the violation of the Bell inequality to verify that the spin singlet state reflects 
quantum nonlocality. When we examine the experimental test of what we have 
called 'Schrodinger nonlocality', however, we find that it is possible to perform 
the experiment in such a way that the perfect correlations by them,selves are 
sufficient to verify that the system being studied shows nonlocal effects of this 
sort. We now turn to the question of such an experimental test. 

For the sake of definiteness, let us consider a particular example of Schrodinger 
nonlocality, namely that which arises from the conjunction of Schrodinger's 
paradox with Mermin's theorem (see section 2.2.1). This proof is constructed 
as follows. The Schrodinger paradox incompletness argument implies that the 
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perfect correlations exhibited by the maximally entangled state lead to the ex- 
istence of a non-contextual value map on all observables of both subsystems. 
If the subsystems of the state in question are associated with Hilbert spaces 
that are at least 4 dimensional (see section 4.2.1), then there must exist such 
a map on the Mermin observables (or a set of formally equivalent observables). 
Mermin's theorem, however, contradicts the possibility of such a map. If we 
combine this theorem with the Schrodinger incompleteness argument the result 
is a quantum nonlocality proof. 

For the reader's convenience, we recall Mermin's theorem here. The ob- 
servables addressed by this theorem are the x and y components of two spin 
I particles, and the observables A, B, C, X, Y, Z which are defined in terms of 
these through the relations 

A = (4.56) 

X = ahah 

and 



C = AB (4.57) 
Z = XY. 

Note that the symbols a^"^ and a^^^ are used here^°, rather than a^^^ and cr^^\ 
which wc employed in the presentation of Mermin's theorem given in section 
2.2.1. We will refer to these observables as the 'Mermin observables', and the 
notation Mi i = 1 ... 10 shall refer to an arbitrary member of the set. In the 

discussion of Mermin's theorem given in section 2.2.1, wc showed that the com- 
mutation relationships among the spin components led to the relation 

4"V(^)4")4'^)a(")a(^)a(")af = -1. (4.58) 

From this and the definitions 4.56 and 4.57, it is easy to see C and Z satisfy 

CZ = -1. (4.59) 

Mermin's theorem implies that there exists no function E{0) on the observables 
Mi that satisfies the all relationships constraining the commuting observables. 

On inspection of the first equation in 4.56, we see that the three observables 
involved are a commuting set: [(^x"\'^jf^] = 0, [ai^^ A] = [ai"\ ai"^ af ^] = 
and [cr^''\ A] = [cTy^\(T^^(Ty^^] = 0. Examination of the other equations in 4.56 

^"in the present section, wc consider the entire set of Mermin observables as being associated 
with one subsystem of a maximally entangled state. Use of the notation cr'^', cr'^) suggests 
observables belonging to separate subsystems, and might have led to confusion. 
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shows that the same holds true for these, i.e. the observables in each form a 
commuting set. Repeated apphcation of the commutation rules may be used to 
show that the sets {C,A,B}, {Z,X,Y}, and {C,Z} are also commuting sets. 
For convenience, we list these sets here: 

{B,4'^\ai^^} (4.60) 
{C,A,B} {Z,X,Y} 

{C,Z} 

The relationships which Mermin's theorem requires the function E{0) to satisfy 
are the defining equations in 4.56, 4.57 and 4.59. 

We now recall the Schrodinger incompleteness argument in detail. In this 
argument, one considers the possibility of separate experimental procedures 
being performed to measure observables of the two subsystems of any given 
maximally entangled state. Suppose that one measures A of subsystem 2 and A 
of subsystem 1, through experimental procedures £{A), and £{A). The perfect 
correlation between these observables implies that from the value of A found in 
any procedure £{A), we can predict with certainty the value of A found in any 
procedure £{A). With this and the assumption of locality, we must conclude 
that A possesses a definite value V{A), which cannot depend on its measurement 
context. The symmetry of the system allows one to argue in a similar fashion 
to show that A also possesses a non-contextual value V{A). The invariance and 
symmetry of the maximally entangled state then imply that such an argument 
can be performed to show that all observables of subsystems 1 and 2 must 
possess definite values. 

To confirm the 'Schrodingcr-Mcrmin' nonlocality, one must perform perfect 
correlation tests in such a way that — in light of the above argument — it follows 
that all of the Mermin observables possess non-contextual values. Let us ex- 
amine the following series of experiments. For each of the Mermin observables, 
perform the following series of ^periments. First, we simultaneously peform 
£{Mi) on subsystem 2 and £{Mi) on subsystem 1 (The precise experiments 
required for each Mj are to be defined below.). Second, we simultaneously per- 
form £' (Mi) on subsystem 2 and £{Mi). If the perfect correlations are verified 
in both cases and for all Mermin observables then it follows that all of the Mer- 
min observables of subsystem 2 must have definite values, and that these must 
be noncontextual. The latter follows since the perfect correlations are observed 
under conditions where the context of Mj is varied (i.e., from f (Mj) to £ {Mi)) 
while that of Mj is not. 

In this way, one performs that part of the experiment which corresponds to 
the incompleteness argument of the nonlocality proof. One might suppose that 
what is required next is the performance of a separate series of tests to judge 
between the existence of the definite values implied by the perfect correlations, 
and the quantum mechanical predictions. As we saw above, Bell's inequality 
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provides for the empirical difference between these two approaches in the case 
of EPR/Bell nonlocaUty. In the present case, we have that Mermin's theorem 
provides such a difference: the noncontcxtual vahics must fail to satisfy the 
relationships 4.56, 4.57, and 4.59, while quantum mechanics predicts that such 
relationships are satisfied. To confirm the quantum mechanical predictions and 
the presence of nonlocality in the maximally entangled state in question, it would 
appear that one must check whether or not these relationships are satisfied. 
However, as we shall see, if the measurements involved in the perfect correlations 
test are done in a particular way, then the existence of perfect correlations is 
sufficient in itself to prove such a conclusion. We now demonstrate this. 

Consider the measurement of the commuting set {A, }. These ob- 

servablcs are related by the first equation in 4.56, which we repeat here for 
convenience: 

^ = a(«)af. (4.61) 

Suppose that to measure the set in question, we first measure the set {a^^ , (rlf^}. 
Then the values for ai"^ and ay^^ so obtained are simply multiplied together to 
determine the value of A. One can, of course measure any commuting set 
that obeys a constraining relationship in this way, i.e., any commuting set 
{0,0u02,...} where 

= /(0i,02,...), (4.62) 

can be measured by first performing an experiment to measure {Oi,02, ■ ■ •}, 
and then evaluating / of the resulting values obtained, to obtain the value of 
O. If the set {A, cjy^^} is measured in such a way then it is a priori that 
the relationship 4.61 will be satisfied by the measurement results, since this 
relationship is "built into" the very procedure itself, i.e., in such a procedure, 
we use 4.61 in determining these results. 

Now let us suppose that in all of procedures £{Mi) involved in the perfect 
correlation tests discussed above, the above described method is followed. That 
it is possible to use this method in each case is clear from the fact that every 
commuting set in 4.60 is constrained by one of the relationships of 4.56, 4.57, 
and 4.59. If we follow such a method to measure the perfect correlations, then 
it is a priori that the measurement results will obey the commuting relation- 
ships 4.56, 4.57, and 4.59. Since, as we have mentioned, it is the judgment of 
whether or not these relationships are obeyed which is required to complete our 
laboratory test of nonlocality, it follows that the perfect correlation tests by 
themselves are sufficient for this test. 

If, in fact, the perfect correlation test described above gives a 'positive' 
result, then the conclusion of quantum nonlocality necessarily follows. The 
only local theoretical interpretation of such a result — the existence of definite 
values — is immediately ruled out since (through Mermin's theorem) it follows 
that such values cannot satisfy the commuting relationships, which are a priori 
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obeyed when one performs the commuting set measurements in the fashion just 
discussed. 

Moreover, the Schrodinger nonlocality that follows from either of the other 
theorems studied in chapter 2 (Gleason's, and Kochen and Specker's) can also 
be tested through the confirmation of perfect correlations. This follows since 
every observable among the set addressed by each theorem is a member of 
two 'incompatible commuting sets', where each set obeys a relationship of the 
form O = f{0i,02, ■ ■ •)■ Thus, each observable can be measured by two distinct 
procedures £{Oi) and £ (Oj) for which the commuting relationships are a priori. 

4.3 Schrodinger's paradox and von Neumann's 
no hidden variables argument 

Lastly, wc would like to make a few observations which are of historical interest. 
We now focus on the line of thought Schrodinger followed subsequent to his gen- 
eralization of the Einstein-Podolsky-Rosen paradox. In addition, we consider 
the question of the possible consequences had he repeated the mistake made by 
a well-known contemporary, or had he anticipated any of several mathematical 
theorems that were developed somewhat later. 

Having concluded the existence of definite values on the obscrvablcs, Schrodinger 
considered the question of the type of relationships which might govern these 
values. As we discussed in section 1.4.6, he was able to show that no such values 
can obey the same relationships that constrain the observables themselves. In 
particular, he observed that the relationship 

H=p^+a^q^, (4.63) 

is not generally obeyed by the eigenvalues of the obscrvablcs H,p^,q^, so that 
no value map V{0) can satisfy this equation. From this it foUows^^ immediately 
that there exists no value map which is linear on the observables. Thus we see 
that Schrodinger's argument essentially leads to the same conclusion regarding 
hidden variables as von Neumann's theorem. Schrodinger did not, however, 
consider this as proof of the impossibility of hidden variables. 

Instead, the fact that the definite values of his EPR generalization must 
fail to obey such relationships prompted Schrodinger to consider the possibility 
that no relationship whatsoever serves to constrain them: [91] (emphasis due 
to original author) "Should one now think that because we are so ignorant 
about the relations among the variable-values held ready in one system, that 
none exists, that far-ranging arbitrary combination can occur?" Note, however, 
that if the 'variable-values' obey no constraining relationship, each must be 
an independent parameter of the system. Thus, Schrodinger continues with 

^^To show this, one need only note that the value of any observable /(O) will be / of the 
value of O, where / is any mathematical function (see section 1.4.6). 
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the statement: "That would mean that a system of 'one degree of freedom' 
would need not merely two numbers for adequately describing it, as in classical 
mechanics, but rather many more, perhaps infinitely many." 

Recall the discussion of the hidden variables theory known as Bohmian me- 
chanics presented in section 2.5. There we saw that the state description of a 
system is given in this theory by the wave function ip and the system configura- 
tion q. The mathematical form of tp is the same as in the quantum formalism, 
i.e., V is a vector in the Hilbert space associated with the system. Consider a 
spinless particle constrained to move in 1 dimension. The Bohmian mechanics 
state description would consist of the wave function ip{x) G L2 and position 
a; e IR. Since any L2 function is infinite dimensional, i.e. it is an as- 

signment of numbers to the points x € (—00, 00), the Bohmian mechanics state 
description is infinite-dimensional. Thus, a theory of hidden variables such as 
Bohmian mechanics provides just the type of description that Schrodinger's 
speculations had led him to conclude. 

Let us now suppose that rather than reasoning as he did, Schrodinger instead 
committed the same error as von Neumann. In other words, we consider the 
possible consequences had Schrodinger regarded the failure of the definite values 
to satisfy the same relations as the observables as proof that no such values 
can possibly agree with quantum mechanics. This false result would appear 
to refute the conclusion of Schrodinger's generalization of EPR; it would seem 
to imply that the Schrodinger paradox leads to a conflict with the quantum 
theory. The Schrodinger paradox, like the EPR paradox, assumes only that 
the perfect correlations can be explained in terms of a local theoretical model. 
Hence, if Schrodinger had concluded with von Neumann that hidden variables 
must conflict with quantum mechanics, then he would have been led to deduce 
quantum nonlocality. 

As wc saw above, such a conclusion actually follows from the combination 
of Schrodinger's paradox with any of the spectral incompatibility theorems, for 
example, those of Glcason. Kochen and Spcckcr, and Mcrmin. Since Glcason's 
theorem involves the proof of the trace relation 2.3, it seems reasonable to regard 
it as being the most similar of these theorems to that of von Neumann, which 
features a derivation of this same formula^^. Note that to whatever degree one 
might regard Glcason's theorem as being similar to von Neumann's, one must 
regard Schrodinger as having come to within that same degree of a proof of 
quantum nonlocality. 

4.4 Summary and conclusions 

Our investigation of the hidden variables issue was motivated by several mathe- 
matical results that have been interpreted as proofs either of the incompatibility 

^^Von Neumann's derivation is based on quite different assumptions, as we saw. 
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of hidden variables with the quantum theory, or of the existence of serious hm- 
itations on such theories. We have reviewed the arguments first presented by 
J.S. Bell, according to which not only do these theorems fail to demonstrate 
the impossibility of hidden variables, but the restrictions they place on such 
theories — contextuality and nonlocality — are quite similar to particular features 
of the quantum theory itself. When considering the theorems of Glcason and 
of Kochen and Specker, we found that they imply essentially that within any 
hidden variables theory, the way in which values are assigned each observable O 
must allow for contextuality — an attribute reflecting the quantum formalism's 
rules of measurement. Nonlocality is certainly surprising and unexpected, yet 
it has been proved as a feature intrinsic to quantum mechanics from the com- 
bination of Bell's theorem with the spin singlet version of EPR. We found also 
that from Erwin Schrodinger's extensive generalization of the EPR paradox, 
it follows that every spectral-incompatibility theorem (such as the theorems of 
Gleason, Kochen and Specker, and Mermin) will give a new proof of "nonlocality 
without inequalities." 

We began our exposition by discussing the earliest work on hidden variables, 
John von Neumann's. Von Neumann considered the possibility of a theory whose 
state description would supplement that of the quantum formalism with a pa- 
rameter we called A. He regarded this scheme as a way to introduce determinism 
into the quantum phenomena. Mathematically, such determinism is represented 
by requiring that for each tp and A, i.e., for each state, there exists a function 
assigning to each observable its value. Von Neumann showed that no function 
E{0) on the observables satisfying his assumptions can be such a value map. 
From this, he concluded that empirical agreement between any hidden variables 
theory and the quantum theory is impossible. That this conclusion is unjustified 
follows since one of von Neumann's assumptions — that E{0) be linear on the 
observables — is quite unreasonable. There is no basis for the demand that E{0) 
obey O = X + Y where [X, Y] ^ 0, since each of these observables is measured 
by a distinct procedure. 

The theorems of Gleason , Kochen and Specker , and Mermin seem at first 
to succeed where von Neumann failed, i.e., to prove the impossibility of hidden 
variables. Ncv(Tthek^ss. as Bell has shown [8, 16], these theorems also fail as 
arguments against hidden variables, since they do not account for contextuality. 
This concept is easily illustrated by examination of the quantum formalism's 
rules of measurement. We find that the 'measurement of an observable O' can 
be performed using distinct experimental procedures £{0) and £ (O). That 
£{0) and £ are distinct is especially obvious if these measure the commut- 
ing sets C and C where C and C both contain O, but the members of C fail 
to commute with those of C . It is therefore quite reasonable to expect that 
a hidden variables theory should allow for the possibility that different proce- 
dures for some observable's measurement might yield different results for an 
individual system. That it is necessary to account for the detailed experimental 
arrangement recalls the views of Niels Bohr, who warns us of [90, page 210] 
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"the impossibility of any sharp separation between the behavior of atomic ob- 
jects and the interaction with the measuring instruments which serve to define 
the conditions under which the phenomena appear." 

Examples of just such incompatible commuting sets are found among the ob- 
servables in each of the theorems we addressed in chapter 2: Gleason's, Kochen 
and Specker's, and Mermin's. For example, in the theorem of Kochen and 
Specker, the commuting sets are simply of the form {s^, Sy, s^}, i.e., the squares 
of the spin components of a spin 1 particle taken with respect to some Cartesian 
axis system x, y, z. Here one can see that a given observable belongs to both 
{s^,Sy,s^}, and {s^, s^, , s^, }, where the y and z' axes are oblique relative to 
the y, and z axes. Since the theorems of Gleason, Kochen and Specker, and Mer- 
min consider a function E{0) which assigns a single value to each observable, 
they cannot account for the possibility of incompatible measurement procedures 
which the quantum formalism's rules of measurement allows us. Clearly, the ap- 
proach taken by those theorems falls far short of addressing the hidden variables 
issue properly. Thus, we come to concur with J.S. Bell's assessment that [16] 
"What is proved by impossibility proofs is lack of imagination." 

To address the general question of hidden variables, one must allow for 
this important feature of contextuality. The theorems of Gleason, Kochen and 
Specker, and Mermin may be seen as explicit proofs that this simple and natural 
feature is necessary in any hidden variables theory. In particular, any attempt 
to construct a value map that neglects this feature will fail in that it cannot 
satisfy the requirement that the relationships constraining the commuting sets 
must be obeyed. We have seen that there is a simpler way to express this resiilt: 
there exists no value map on the observables mapping every commuting set to 
one of its joint-eigenvalues. 

We were able to gain some measure of additional insight into contextuality in 
examining Albert's example. As one can see from the experiments considered by 
Albert, the Hermitian operators cannot be considered as representing properties 
intrinsic to the quantum system itself. Instead, the results of the "measurement 
of a quantum observable" must be considered as the joint-product of system 
and measuring apparatus. 

When we come to consider Bell's theorem, we must do so in the context of 
the spin singlet version of the Einstein-Podolsky-Rosen paradox. This argu- 
ment essentially shows that locality necessarily leads to the existnce of definite 
values for all components of the spin of both particles of the spin singlet state. 
Since the quantum mechanical description of the state does not account for 
such values, EPR conclude that this description is incomplete. Bell's theorem 
essentially continues where the spin singlet EPR analysis concluded. The fixed 
values for the spin components of the two particles are represented by functions 
A{X,a), B{X,b) where a, b are unit vectors in the directions of the axis of the 
spin component for particle 1 and 2 respectively. In his analysis. Bell consid- 
ers the statistical correlation between spin component measuring experiments 
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carried out on the two particles. According to Bell's theorem, the theoretical 
prediction for this correlation derivable from the variables A{X,a) and B(X,b) 
must satisfy 'Bell's inequality'. The prediction given by quantum mechanics 
does not generally agree with this inequality. Bell's theorem in itself provides a 
proof that local hidden variables must conflict with quantum mechanics. 

It is important to note that what might appear to be Bell's assumption — the 
existence of definite values for all spin components — is identical to the conclusion 
of the spin singlet version of the Einstein-Podolsky-Rosen argument. In fact, 
Bell assumes nothing beyond what follows from EPR. Therefore the proper way 
to assess the implications of these arguments is to combine them into a single 
analysis that will begin with the assumptions of the spin singlet EPR paradox, 
and end with the conclusion of Bell's theorem; i.e., the assumption of locality 
leads to the conclusion of Bell's inequality. Since this inequality, as we saw, 
disagrees with the quantum theory, we finally have an argument that locality 
implies disagreement with the predictions of quantum, mechanics. This agrees 
with Bell's own assertion in the matter: [14] "It now seems that the non-locality 
is deeply rooted in quantum mechanics itself and will persist in any completion." 

We have now come to understand the issues of contextuality and nonlocality 
as features of a hidden variables interpretation of quantum theory. Contextual- 
ity in hidden variables is a natural feature to expect since it reflects the possibil- 
ity of distinct experimental procedures for measurement of a single observable. 
As J.S. Bell expresses it: [8] "The result of an observation may reasonably de- 
pend not only on the state of the system (including hidden variables) but also 
on the complete disposition of the apparatus." According to the above results, 
the fact that nonlocality is required of hidden variables does not in any way 
diminish the prospect of these types of theories. As we have mentioned, there 
has existed a successful theory of hidden variables since 1952, and there is no 
reason not to consider this theory (Bohmian mechanics) as a serious interpreta- 
tion of quantum mechanics. We noted in section 1.1.2 that Bohmian mechanics 
possesses the advantages of objectivity and determinism. 

In the fourth and final chapter, we addressed Erwin Schrodingcr's general- 
ization of the EPR paradox. Besides greatly extending the incompleteness argu- 
ment of EPR, Schrodingcr's analysis provides for a new set of "nonlocality with- 
out inequalities" proofs, which have several important features. The Schrodinger 
paradox concerns the perfect correlations exhibited not just by a single quantum 
state, but for a general class of states called the maximally entangled states. A 
maximally entangled state is any state of the form 

JV 

$^|<^„)®|V'n), (4.64) 

ra=l 

where {!</>«)} and {|V'n)} are bases of the (A/'-dimensional) Hilbert spaces of 
subsystems 1 and 2, respectively. As in the EPR analysis (both the spin sin- 
glet and the original version) Schrodinger gives an incompleteness argument. 
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according to which there must exist precise values for ah observables of both 
subsystems. It may be shown using Gleason's theorem, Kochen and Specker's 
theorem, Mcrmin's theorem, or any other 'spectral incompatibility' proof, that 
the definite values concluded in the Schrodinger's paradox must conflict with 
the empirical predictions of quantum mechanics. The implication of such a 
disagreement is quantum nonlocality. This conflict in empirical predictions dif- 
fers from that developed within Bell's theorem, in that it involves predictions 
for individual measurements, rather than the statistics of a series of measure- 
ments. Thus, combining Schrodinger's paradox with any spectral incompatibil- 
ity theorem provides a 'nonlocality without inequalities' proof. Moreover, as we 
observed in section 4.2.1, the conflict between the Schrodinger incompleteness 
and such a theorem exists even when one considers only the observables of one 
of the two subsystems. We have referred to this type of proof by the name 
"Schrodinger nonlocality." 

When we consider the experimental verification of Schrodinger nonlocality, 
we find a curious result. The measurement of a commuting set satisfying an 
equation of constraint may be carried out in such a way that the set of values 
obtained will satisfy this constraint a priori. Since the definite values concluded 
in the Schrodinger paradox cannot satisfy these constraining relationships, per- 
forming an experimental test using such measurement procedures provides that 
the perfect correlations themselves are sufficient to imply nonlocality. 

All this leads us to inquire how Schrodinger himself regarded his results, 
and what further conclusions he drew within his remarkable paper. Clearly, it 
would have been possible for him to argue for quantum nonlocality had he antic- 
ipated the results of any of a wide variety of theorems including at least Bell's 
(see section 4.1.4), Gleason's, Kochen and Specker's, Mermin's, or any other 
spectral incompatibility theorem. Instead, as we saw in chapter 1, Schrodinger 
essentially reproduced the von Neumann argument against hidden variables, in 
his observation of a set of observables that obey a linear relationship not satis- 
fied by the set's eigenvalues. What may be scon from von Neumann's rcsiilt is 
just what Schrodinger noted relationships constraining the observables do not 
necessarily constrain their values. Schrodinger continued this line of thought by 
speculating on the case for wliic;li no relation whatsoev(T constrained the values 
of the various observables. In light of his generalization of the EPR paradox, 
this line of thought led Schrodinger to the idea that the quantum system in 
question might possess an infinite number of degrees of freedom, which concept 
is actually quite similar to that of Bohmian mechanics. 

If, on the other hand, Schrodinger had made von Neumann's error, i.e., had 
concluded the impossibility of a map from observables to values, this mistaken 
line of reasoning would have permitted him to arrive at the concept of quan- 
tum nonlocality. Thus, insofar as one might regard the von Neumann proof as 
"almost" leading to the type of conclusion that follows from Gleason's theorem, 
one must consider Schrodinger as having come precisely that close to a proof of 
quantum nonlocality. 
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It is quite interesting to see that many of the issues related to quantum 
mechanical incompleteness and hidden variables were addressed in the 1935 
work [91, 92, 93] of Erwin Schrodingcr . Schrodinger's work seems to be the 
most far reaching of the early analyses addressed to the subject. Not only did he 
see deeper into the problem than did von Neumann, but Schrodinger developed 
results beyond those of the Einstein, Podolsky, Rosen paper — an extension of 
their incompleteness argument, and an analysis of incompleteness in terms of 
the implication of von Neumann's theorem. It seems clear that the field of 
foundations of quantum mechanics might have been greatly advanced had these 
features of Schrodinger's paper been more widely appreciated at the time it was 
first published. 
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